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METHODS OF P RE FORM IN G HOMOLOGOUS RECOMBINATION BASED 
MODIFICATION OF NUCLEIC ACIDS IN RECOMBINATION DEFICIENT CFI I « 
AND USE OF THE MODIFIED NUCLEIC ACID PRODUCTS THEREOF ' 

GOVERNMENTAL SUPPORT 

5 rhe research ,eadin S to che Present invention was supported, at least in pan, by a grant from 
the National Science Foundation Grant No. MCB-93 16625. Accordingly, the Government 
may have certain rights in the invention. 

FIELD OF THE INVENTION 

This invention relates generally to methods of modifying genes with specificity in 
0 recombination deficient cells by transiently enabling homologous recombination in the cells. 
Included in the invention are conditional replication shuttle vectors which bestow transient 
recombination capabilities to an otherwise recombination deficient cell. The independent 
origin based cloning vectors containing the modified genes and methods of using the 
independent origin based cloning vectors containing the modified genes are also included in 
5 the present invention. 



BACKGROUND O F THE INVENTION 

Functional analyses of genes /„ vivo frequently involve the introduction of modified genomic 
DNA into the germline to generate transgenic animals [Jaenisch et al.. Science 240: 1468 
(1985); Bnnster, Cell 41:343 (1985)]. The genomic DNA sequences containing mtrons and 
3 essential regulatory sequences have been shown to be expressed in vivo in cases where 

simple cDNA constructs cannot be expressed [Bnnster et al, Proc.Natl.Acad.Sci. 85:836-840 
(1988)]. Furthermore, the size of the genomic DNA that can be readily manipulated in vitro 
and introduced into the germline can be a critical determinant of the outcome of the 
functional analysis of a gene since elements that are important for high level, tissue specific 
and position-independent expression of the transgene may be located at a long distance from 
the gene itself [Dillon et al.. Trends Genet. 9:134 (1993); Kennison. Trends Genet. 9:75 
(1993); Wilson etal, Annu.Rev.Cell.Biol. 6:679 (1990)]. 

On the other hand, the use of such large genomic transgenes has several practical problems. 
For example, the size of the transgene is presently limited due to constraints on the sequence 
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length that can be cloned and stably maintained in a conventional plasmid or a cosmid. Thus 
DNA sequences suspected of being nonessential are often omitted when designing the 
constructs to be transferred because of the size limitation. In addition, in vitro manipulations 
of large DNAs oftentimes lead to mechanical shear [Peterson et al.. TIG 13:61-66]. 

5 Yeast artificial chromosomes (YACs) allow large genomic DNA to be modified and used for 
generating transgenic animals [Burke et al.. Science 236:806; Peterson et al.. Trends Genet. 
13:61 (1997): Choi, et ui.Nat. Genet.. 4:1 17-223 ( 1993). Davies. et al.. Biotechnology 
11:91 1-914 (1993). Matsuura. et al.. Hum. Mol. Genet.. 5:45 1-459 ( 1996), Peterson et al.. 
Proc. Natl. Acad. Sci., 93:6605-6609 (1996); and Schedl, eta/.. Cell, 86:71-82 (1996)]. 
10 Other vectors also have been developed for the cloning of large segments of mammalian 
DNA. including cosmids. and bacteriophage PI [Sternberg et al.. Proc. Natl. Acad. Set. 
U.S.A.. 87: 103-107 (1990)]. YACs have certain advantages over these alternative large 
capacity cloning vectors [Burke et al.. Science, 236:806-812 (1987)]. The maximum insert 
size is 35-30 kb for cosmids, and 100 kb for bacteriophage PI, both of which are much 
5 smaller than the maximal insert for a YAC. However, there are several critical limitations in 
the YAC system including difficulties in manipulating YAC DNA, chimerism and clonal 
instability [Green et al.. Genomics. 11:658 (1991); Kouprina et al.. Genomics 21:7 (1994); 
Larionov et al.. Nature Genet. 6:84 (1994)]. As a result, generating transgenic mice with an 
intact YAC remains a challenging task [Burke et al.. Science 236:806; Peterson et al.. Trends 
0 Genet. 13:61 (1997)1. 

An alternative to YACs are E. coli based cloning systems based on the E. coli fertility factor 
that have been developed to construct large genomic DNA insert libraries. They are bacterial 
artificial chromosomes (BACs) and P-l derived artificial chromosomes (PACs) [Mejia et al., 
Genome Res. 7:179-186 (1997); Shizuya et al.. Proc. Natl. Acad. Sci. 89:8794-8797 
* ( 1 992);Ioannou et al. . Nat. Genet. , 6:S4-89 ( 1 994); Hosoda et al. , Nucleic Acids Res. 18:3863 
(1990)]. BACs are based on the E. coli fertility plasmid (F factor); and PACs are based on 
the bacteriophage PI. The size of DNA fragments from eukaryotic genomes that can be 
stably cloned in Escherichia coli as plasmid molecules has been expanded by the advent of 
PACs and BACs. These vectors propagate at a very low copy number (1-2 per cell) enabling 
genomic inserts up to 300 kb in size to be stably maintained in recombination deficient hosts 
(most clones in human genomic libraries fall within the 100-200kb size range). The host cell 
is required to be recombination deficient to ensure that non-specific and potenfially 



WO 98/59060 

PCT/US98/I2966 

3 

deleterious recombination events are kept to a very minimum. As a result, libraries of PAC 
and BACs are relatively free of the high proportion of chimeric or rearranged clones typical 
m VAC libraries. [Monaco eta!.. Trench Biotechnol 12:280-286 (1994): Boyseu etal.. 
Genome Research. 7:330-333 ( 1 997)]. I n addition, isolating and sequencing DNA from 
> PACs or BACs involves simpler procedures than for YACs. and PACs and BACs have a 
higher clon.ng efficiency than YACs [Shizuya et al. Proc. Natl. Acad. Sci. 89:8794,8797 
( I992);loannou et al.. Genet.. 6:84-89 (1994); Hosoda et at.. Nucleic Acids R es . 18:386. 
( ! 990,]. Such advantages have made BACs and PACs important tools for physical mapping 
m many genomes [Woo et al. . Nucleic Acids Res. , 22:4922 ( 1 994); Kim et al. , 
0 ProcNatl.Acad.Sci. 93:6297-6301 (1996): Wang etal.. Genomics 24:527 ( 1994); Wooster et 
at.. Nature 378:789 ( .995)]. Furthermore, the PACs and BACs are circular DNA molecules 
that are readily isolated from the host genomic background by classical alkaline lys.s 
[Bimboim ei al.. Nucleic Acids Res. 7:1513-1523 (1979], 

Functional characterization of a gene of interest contained by a PAC or BAC clone generallv 
> entails transferring the DNA into a eukaryotic cell for transient or long-term expression. A * 
transfechon reporter gene, e.g., a gene encoding IacZ, together with a selectable marker, e.g.. 
neo. can be inserted into a BAC [Mejia et al., Genome Res. 7.179-186(1997). Transfected" 
cells can be then detected by staining for X-Gal to verify DNA uptake. Stably transformed 
cells are selected for by the antibiotic G41 S. 

However, while PACs and BACs have cloning capacities up to 350kb, performing 
homologous recombination to introduce inutat.ons into a gene of interest has not been 
demonstrated [Peterson et al., TIG 13:61-66]. Indeed, although BACs or PACs have become 
an important source of large genomic DNA in genome research, there are still no methods 
available to modify the BACs or PACs. Furthermore, no germlme transmission of intact 
BACs or PACs in transgenic mice have been reported. These, as well as other disadvantages 
of BACs and PACs greatly limit their potential use for functional studies. Therefore, there is 
a need for an improved cloning vector for germline transmission of selected genes in 
transgenic animals. More particularly there is a need for a clon.ng vector that has the 
capacity to contain greater than 100 kilobases of DN A. which can be readily manipulated and 
-solated. but still can be stably stored ,n library rclauvdy free of rearranged clones. In 
addition, there is a need to prov.de methodology tor generating such cloning vectors. There 
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is also a need to apply such vectors to improve current technologies such as gene targeting. 

Gene targeting has been used in various systems, from yeast to mice, to make site specific 
mutations in the genome. Gene targeting is not only useful for studying function of proteins 
5 in vivo, but it is also useful for creating animal models for human diseases, and in gene 
therapy. The technique involves the homologous recombination between DNA introduced 
into a cell and the endogenous chromosomal DNA of the cell. However, in the vertebrate 
system, the rate of homologous recombination is very low. as compared to random 
integration. The only cell line that allows a relatively high homologous recombination rate 

10 and maintains the ability to populate the germline is the murine 129 embryonic stem cells 
(ES cells). Using this specialized cell, mice can be generated with a targeted mutation 
(Gene Targeting, a practical approach Ed. by A. Joyner, IRL Press: Oxford, New York, 
Tokyo). However, the rate of homologous recombination for some gene loci in ES cells is 
still extremely low (< 1 %), the procedure is labor intensive, and the cost of generating 

15 targeted mutant mice is very expensive. Moreover, since there are no ES cells available for 
vertebrates other than mice, gene targeting in a germline is still not possible for other 
vertebrates. 

The major limitation for gene targeting in vertebrate cells remain to be the low targeting 
frequency. One critical factor affecting the targeting frequency is the total length of 

20 homology. Deng and Capecchi (MCB, 12:3365-3371) have shown that gene targeting 
frequency is linearly-dependent on the logarithm of the total homology length over 
homology lengths of 2.8kb to I4.6kb. Since the curve did not plateau at the 14.6kb 
homology, it is likely that incorporating greater homology lengths into the targeting vector 
will further increase the homologous recombination rate. Using a mathematical model 

25 developed by Fujitani et al, [Genetics, 140:797-809, (1995)], an estimate can be made that 
with a total homology of lOOkb isogenous DNA (i.e., DNA from the same strain of mice), 
the gene targeting rate in ES cells would be 10%. This is a dramatic improvement over the 
conventional 14.6kb targeting vector, which only yields a corresponding rate of 
only 0.03%. Further support for the present strategy Le.„ using a large DNA construct for 

30 gene targeting rate comes from an experiment with Mycobacterium tuberculosis, the causal 
agent of tuberculosis. Like vertebrate cells, gene targeting in TB has a very low rate, 
mainly due to the predominance of random integration over homologous recombination. It 
has been demonstrated that using a 40-50 kb linear targeting construct, a 6% targeting 
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frequency could be obtained, whereas no targeting event was obtained at all with a smaller 
(< iOkb) targeting construct [Balasubramanian et al., J. of Bacteriology 178:273-279 
(1996)]. Therefore, there is a need to construct large gene targeting constructs to allow 
efficient gene targeting in many biological systems. 

:> The citation of any reference herein should not be construed as an admission that such 
reference is available as "Prior An" to the instant application. 

SUMMARY OF THE INVENTION 

The present invention provides a novel and efficient method of modifying independent oricin 
based cloning vectors for in vitro and in vivo gene expression. In its broadest embodiment. 

10 the present invention provides a method of selectively performing homologous 

recombination on a particular nucleotide sequence contained in a recombination deficient 
host cell, i.e.. a cell that cannot independently support homologous recombination. The 
method employs a recombination cassette which contains a nucleic acid that selectively 
integrates into the particular nucleotide sequence when the recombination deficient host cell 

15 is induced to support homologous recombination. The method comprises introducing the 
recombination cassette into the recombination deficient host cell, and inducing the 
recombinantly deficient host cell to transiently support homologous recombination, thereby 
allowing the nucleic acid to integrate into the particular nucleotide sequence. In a preferred 
embodiment, unselected nucleotide sequence rearrangements and deletions, which are 

!0 characteristic of host celis that support homologous recombination, are not evident with 
restriction endonuclease digestion map analysis with a restriction enzyme such as Hindllh 
EcoRl, XtioL or ^vrll. In a more preferred embodiment, unselected nucleotide sequence 
rearrangements and deletions are not evident with restriction endonuciease digestion map 
analysis with two or more restriction enzymes. 

5 In a particular aspect of the present invention, the recombination deficient host cell cannot 
independently support homologous recombination because the host cell is RecA\ In this 
aspect of the invention, inducing the host cell to transiently support homologous 
recombination comprises inducing the transient expression of a RecA-like protein in the hosi 
cell. In a preferred embodiment, inducing the transient expression of the RecA-like protein 

0 can be performed with a conditional replication shuttle vector. In a more preferred 
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embodiment the conditional replication shuttle vector is a temperature sensitive shuttle 
vector (TSSV) that replicates at a permissive temperature, but does not replicate at a 
non-permissive temperature. 

In one particular embodiment of this ape. inducing the transient expression of the RecA-like 
5 protein comprises transforming the host cell with the TSSV at a permissive temperature, and 
growing the host cell a: a non-permissive temperature. The TSSV encodes a RecA-like 
protein that is expressed in the host cell and supports the homologous recombination between 
a nucleic acid contained in a recombination cassette and the particular nucleotide sequence 
contained in the host cell. The TSSV encoding the RecA-like prote.n is diluted out when the 
0 host cell is grown at the non-permissive temperature. In one particular embodiment of this 
type the permissive temperature is 30°C and the non-permissive temperature is 43 °C. 

In a more intricate version of the present invention, the particular nucleotide sequence which 
has been selected to undergo homologous recombination is contained in an independent 
origin based cloning vector (IOBCV) that is comprised by the host cell, and neither the 
5 independent origin based cloning vector alone, nor the independent origin based cloning 
vector in combination with the host cell, can independently support homologous 
recombination. In a particular embodiment of this type both the independent origin based 
cloning vector and the host cell are RecA'. and inducing the host cell to transiently support 
homologous recombination comprises inducing the transient expression of the RecA-like 
protein to support homologous recombination in the host cell. In one particular embodiment 
the independent origin based cloning vector is a Bacterial or Bacteriophage-Derived 
Artificial Chromosome (BBPAC) and the host cell is a host bacterium. 

In a preferred embodiment, inducing the transient expression of the RecA-like protein is 
performed with a conditional replication shuttle vector that encodes the RecA-like protein. 
In a more preferred embodiment the conditional replication shuttle vector is a temperature 
sensitive shuttle vector (TSSV) that replicates at a permissive temperature, but does not 
replicate at a non-permissive temperature. In one particular embodiment of this type the 
permissive temperature is 30°C and the non-permissive temperature is 43 C C. 

In one embodiment the RecA-like protein is controlled by an inducible promoter and the 
transient expression of the RecA-like protein is achieved by the transient induction of the 
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inducible promoter in the host cell. In another embodiment, the RecA-like protein is 
controlled by a constitutive promoter with the transient expression induced by the TSSV. 

In a preferred embodiment the TSSV also comprises a recombination cassette and a first 
5 gene which bestows resistance to a host cell that contains the TSSV against a first toxic 
agent. In addition, the first gene can be counter-selected against. The recombination 
cassette, the RecA-like protein gene, and the first gene are linked together on the TSSV such 
that when the nucleic acid integrates (i.e. resolved) into the particular nucleotide sequence, 
the RecA-like protein gene and the first gene remain linked together, and neither the 
1 0 RecA-like protein gene nor the first gene remain linked to the integrated nucleic acid. 

In a particular embodiment of this type, the independent origin based cloning vector is a 
BBPAC and the host cell is a bacterium. The BBPAC further contains a second gene that 
bestows resistance to the host cells against a second toxic agent. Introducing the 
recombination cassette into the host cells is performed by transforming the host cell w ith the 
1 5 TSSV. Inducing the transient expression of the RecA-like protein to support homologous 
recombination comprises: (i) incubating the host cells at a permissive temperature in the 
presence of the first toxic agent and the second toxic agent, wherein transformed host cells 
containing the TSSV and the BBPAC are selected for and wherein the RecA-like protein is 
expressed. A first homologous recombination event occurs between the recombination 
0 cassette and the particular nucleotide sequence forming a co-integrate between the TSSV and 
the BBPAC, wherein the TSSV is either free or part of a co-integrate; (ii) incubating the 
transformed host cells at a non-permissive temperature in the presence of the first toxic agent 
and the second toxic agent, wherein host cells containing a TSSV co-integrate are selected 
for, and wherein free TSSV cannot replicate; (iii) selecting a host cell containing a co- 
5 integrate between the TSSV and the BBPAC by Southern analysis; (iv) incubating the host 
cells containing a co-integrate between the TSSV and the BBPAC at a non-permissive 
temperature in the presence of the second toxic agent, wherein a second homologous 
recombination event occurs between the recombination cassette and the particular nucleotide 
sequence, therein integrating the nucleic acid into the particular nucleotide sequence and 
3 forming a resolved host cell, i.e., a host cell containing a resolved BBPAC; and (v) 

incubating the host cells containing the resolved BBPAC in the presence of the second toxic 
agent, and a counter-selecting agent, and wherein the counter-selecting agent is toxic to host 
cells containing the first gene, and wherein host cells containing the RecA-like protein gene 
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are amoved. Another embodiment funher comprises selecting a host cell containing the 
resolved BBPAC by colony hybridization with a labeled probe that binds to a DNA 
homologue of the nucleic acid, an mRNA homologue of the nucleic acid, and/or a protein 
encoded by the nucleic acid. In a particular embodiment, the permissive temperature is 
30 C the non-permissive temperature is 43 C C. In a preferred embodiment the incubating of 
host cells containing the resolved BBPAC in the presence of the second toxic agent and 
counter-selecting agent is performed at 37°C. 

Preferred embodiments further comprise the generating of the recombination cassette by 
placing a first genomic fragment 5' of the specific nucleic acid that is to selectively integrate 

10 into the particular nucleotide sequence, and placing a second genomic fragment 3' of the 
specific nucleic acid. The first genomic fragment corresponds to a region of the particular 
nucleotide sequence that is 5' to the region of the particular nucleotide sequence that 
corresponds to the second genomic fragment. Thus, both the first genomic fragment and the 
second genomic fragment contain portions of the particular nucleotide sequence. In one such 

15 embodiment, both the first genomic fragment and the second genomic fragment contain 250 
or more basepairs of the particular nucleotide sequence. In a preferred embodiment, the first 
and second genomic fragments are about the same size. In another embodiment, both the 
first genomic fragment and the second genomic fragment contain 500 or more basepairs of 
the particular nucleotide sequence. In still another embodiment, both the first genomic 
70 fragment and the second genomic fragment contain 1000 or more basepairs of the particular 
nucleotide sequence. In one particular embodiment the recombination cassette is generated 
m a building vector and the recombination cassette is subsequently transferred to the T3SV. 
In a particular embodiment the first gene confers tetracycline resistance and the counter- 
selecting agent is fusaric acid. In a preferred embodiment the RecA-like protein is recA. In 

5 the more preferred embodiment the TSSV is pSVl.RecA having the ATCC no. 97968. 

In a related aspect of the present invention the RecA-like protein is controlled by an 
inducible promoter, and the transient expression of the RecA-like protein is achieved by the 
transient induction of the inducible promoter in the host cell. In one embodiment of this 
type, the independent origin based cloning vector is a BBPAC and the recombination 
0 deficient host cell is an E. coli bacterium. In a preferred embodiment the RecA-like protein 
is recA. 
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The present invention also provides a conditional replication shuttle vector that encodes a 
RecA-like protein. In one such embodiment the RecA-like protein is controlled by an 
inducible promoter. In a preferred embodiment the conditional replication shuttle vector is a 
temperature sensitive shuttle vector (TSSV). The RecA-like protein of the TSSV can be 
controlled by either a constitutive promoter or by an inducible promoter. In one embodiment 
the TSSV contains a gene that can be counter-selected against. In a specific embodiment of 
this type the TSSV contains a gene that confers tetracycline resistance. In another 
embodiment the TSSV contains a RecA-like protein that is recA. In still another 
embodiment the TSSV contains both a gene that confers tetracycline resistance and a RecA- 
0 like protein that is recA. In a preferred embodiment the TSSV is pS VI. RecA having the 
ATCC no. 9796S. 

The present invention also provides an independent origin based cloning vector that contains 
a particular nucleotide sequence that has undergone homologous recombination with a 
conditional replication shuttle vector in a RecA- host cell, wherein the conditional replication 

5 shuttle vector encodes a RecA-like protein. In one such embodiment the particular 
nucleotide sequence is part of the gene that encodes the murine zinc finger gene, RU49 
which is contained by the independent origin cloning vector. In one preferred embodiment 
the independent origin based cloning vector has undergone homologous recombination with 
a temperature sensitive shuttle vector in a RecA- host cell, wherein the temperature sensitive 

0 shuttle vector encodes a RecA-like protein. In another embodiment the independent origin 
based cloning vector is a BBPAC. and more preferably a BAC. In a specific embodiment of 
this type the independent origin based cloning vector has undergone homologous 
recombination with a temperature sensitive shuttle vector that is pSVl.RecA having the 
ATCC no. 97968. 

The present invention also provides methods of using the modified independent origin based 
cloning vectors of the present invention to make transgenic animals, perform gene targeting, 
or perform gene therapy. The independent origin based cloning vectors or linearized nucleic 
acid inserts derived from the IOBCVs, for example, can be introduced into a eukaryotic cell 
or animal. 
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In one such embodiment the eukaryotic celi is a fertilized zygote. In another embodiment the 
eukaryotic cell is a mouse ES cell. The gene targeting can be performed to modify a 
particular gene, or to totally disrupt the gene to form a knockout animal. 



In this aspect or tnc present invention, the independent origin based cloning vector contains a 
:> nucleic acid that has undergone homologous recombination with a conditional replication 
shuttle vector in a RecA" whole cell, in which the conditional replication shuttle vector 
includes a RecA like protein. In a preferred embodiment the independent origin based 
cloning vector is a BBPAC. In a more preferred embodiment, the BBPAC has undergone 
homologous recombination with a TSSV. In the most preferred embodiment, the BBPAC 
10 has undergone homologous recombination with the TSSV that is pSVLRecA having the 
ATCC no. 9796S. 

One particular embodiment is a method of using the BBPAC to introduce the nucleic acid 
into an animal to make a transgenic animal comprising pronuclear injecting of the BBPAC 
(or a linearized nucleic acid insert derived from the BBPAC) into a fertilized zygote. In one 
i3 embodiment the animal is a mammal. In a more preferred embodiment the mammal is a 

mouse. In a specific embodiment of this type the independent origin based cloning Vector is 
a BBPAC and the fertilized zygote is a C57BL/6 mouse zygote. In a preferred embodiment 
of this type two picoliters (pi) of less than one ug/ml BBPAC DNA is injected. In a more 
preferred embodiment 2pl of 0.6 ug/ml of DNA is injected. 

20 The.present invention also includes a method of using the BBPAC of the invention to- 
perform gene targeting in a vertebrate cells comprising introducing the BBPAC into the 
vertebrate cell wherein the nucleic acid that has undergone homologous recombination with 
the conditional shuttle vector, undergoes homologous recombination with the endogenous 
chromosomal DNA of the vertebrate cell. In preferred embodiments of this type the 

25 vertebrate cell is a mammalian cell. In a more preferred embodiment of this type the 

mammalian ceil is a human cell. In a related embodiment the vertebrate cell is a fertilized 
zygote and the nucleic acid contains a disrupted gene. In a preferred embodiment the 
conditional shuttle vector is a TSSV. In a more preferred embodiment the TSSV is 
pSVLRecA having the ATCC no. 97968. 
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The present invention also contains kits for performing homologous recombination on 
selected nucleotide sequences contained on an independent origin based clomng vector, such 
as a BBPAC. In one particular embodiment, the kit comprises a conditional replication 
shuttle vector and a building vector. In a preferred embodiment of this type, the kit further 
5 contains a restriction map for the shuttle vector and/or a restriction map for one or more of 
the bu.lding vectors. In a more preferred embodiment, the kit further includes a protocol for 
using the contents of the kit to perform homologous recombination. 

A particular embodiment of the kit contains a TSSV. such as pSVl.RecA and a buildin* 
vector. In one such embodiment the building vector is pBV.IRES.LacZ.PA. In another such 
0 embodiment the building vector ,s pBV.EGFPl. In yet another such embodiment the 

building vector is P BV.IRES.EGFP I . In still another such embodiment the bui.d.n, vector is 
pBV.pGK.Neo.PA. 

In a preferred embodiment two or more building vectors are included in the kit. In a more 
preferred embodtment all four of the above-listed building vectors are included in the kit. 
- Restriction maps for one or more of the building vectors or the TSSV may also be included 
in the kits. In addition, the kits may also include a protocol for using the contents of the kit 
to perform homologous recombination. In one specific embodiment, a kit contains 
pSVl .RecA and one or more of the above-listed vectors also contains fusanc acid and/or 
chloro-tetracvciine. 



Accordingly. ,t is a P r,nc, P a. object of the present invention to provide a method for readiiv 
and specifically modifying an independent ongin based cloning vector in a recomb.nation 

deficient host cell. 

It is a further object of the present invention to provide a method of transiently expressing a 
RecA-like protein ,n a RecA" host cell to allow the specific modification of a gene of interest 
contained by an independent origin based cloning vector. 

It is a further object of the present invention to provide a method of generating deletions. 
subst.tut.ons, and/or point mutations in a specific gene contained by the independent ong.n 
based cloning vector in a RecA' cell. 
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It is a further object of the present invention to provide a conditional replication shuttle 
vector which encodes a RecA-like protein, and which further contains a specific nucleic acid 
in a recombination cassette that selectively undergoes homologous recombination with an 
independent origin based cloning vector when both vectors are present in a recombination 
5 deficient host cell. 



ft is a runher object of the present invention to provide a temperature dependent shuttle 
vector which encodes a RecA-Iike protein. 

It is a further object of the present invention to provide a temperature dependent shuttle 
vector which encodes a RecA-like protein, which further contains a specific nucleic acid in a 
10 recombination cassette that can selectively undergo homologous recombination with a gene 
of interest contained by an independent origin based cloning vector, when both vectors arc 
placed in a recombination deficient host cell. 

It is ?. further object of the present invention to provide a temperature sensitive shuttle vector 
that is pSVLRecA having the ATCC no. 97968. 

5 It is a further object of the present invention to provide a modified independent origin based 
cloning vector that can be used for the pronuclear injection of a nucleic acid contained by 
IOBCV into an animal zygote. 



It is a further object of the present invention to provide a modified independent origin based 
cloning vector that can be transfected into an embryonic stem cell. 

It is a further object of the present invention to provide a method of introducing a linearized 
nucleic acid insert from a modified independent origin based cloning vector into a fertilized 
zygote of an animal. 



It is a further object of the present invention to provide a method of introducing a modified 
independent origin based cloning vector into an embryonic stem cell. 

5 It is a further object of the present invention to provide a method of purifying a larce 
linearized BBPAC 
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These and other aspects of the present invention will be better appreciated by reference to the 
following drawings and Detailed Description. 

BRIEF PFSCRfPTlDM o f thf dr a wrMr.g 

Figure 1 shows the strategy for targeted BAC modification. (I) Two cloning steps are 
> .nvolved in constructing the shuttle vector. The recombination cassette <«enomic fragments 
A and B: and IRES-LacZP-Poly A marker gene) is first constructed ,n the building vector and 
then subcloned into the temperature sensitive pSVl RecA shuttle vector. (II) Co-integra.e 
jonnanon: Co-integrates can be formed through homologous recombination at either the 
homology A or the homology B site, with only the former case illustrated. (Ill) Resoluiion- 
0 Resoked BACs are selected by growth on plates containing fusaric acid and 

chloramphenicol Correctly reso,ved«lones are identified by co.ony hybridizations with an 
insert specific probe (e.g., a PGK polyA probe). 

Figure 2 shows a schematic representation of targeted modifications of the BAC 169 which 
contains the murine zinc finger gene. RU49. BAC169 containing RU49 was obtained from 
screening of the mouse 129 strain BAC genomic DNA library (Research Genetics) Fieure 
2A depicts a restriction map of the BAC169. The position of several exons are shown" The 
region of homology Al (Ilcb PCR fragment) and homology Bl (I.6kb AWW fragment) 
are indicated. Abbreviations: Xhol (Xh). EcoRl (R). HincIlU (H). Xbal (X), Notl (Not) and 
Pmel (Pme). Figure 2B depicts a map of the modified BAC169 with IRES LacZ PolyA 
.nsertion (BAC169. ILPA). An extra /Wl site is inserted with the marker gene (asterisk) 
The s.ze of the two Pme-Not fragments and the Pmel fragment are mdtcated Since the 
marker gene (4kb) is less than the deleted genomic region (7kb). the total size of the 
modified BAC ( 1 28kb) is smaller than the original B AC ( 1 3 1 kb). 

Figure 3 show, Southern blot analyses of BAC co-integrates and resolved BACs. F.gure 3A 
shows a schematic represemat.on of expected Southern blot fragments in BAC169 , n co- 
mtegrates through homology B 1 . and in correctly resolved BACs. When analyzing 
recombination through homology; Bl, an EcoRl digest is used and homology B 1 is used as 
the probe: when analyzing the recombination through homology Al, a Hindlll digest is used 
and the homology A 1 ,s used as probe. Figure 3B shows homology B 1 co-inte^te, The 
EcoRl digest of BAC Cones and controls are probed with homology Bl. 1-4 represent four 
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clones. BAC 169 and pSVl with the recombination cassette were used as controls. Figure 
3C shows the analyses of the 5' ends of the resolved BACs. Resolved BAC clones (1-S) 
were digested with Hi>uIU\ and probed with homology A 1 . The controls are homology B 1 
co-integrates (CI). BAC 169 and the shuttle vector with recombination cassettes. Figure 3D 
5 shows the analyses of the 3' ends of the resolved BACs. The same procedure is used as 

described above except the resolved BAC clones were digested with EcoRl and probed with 
homoloav B 1. 



Figure 4 shows pulsed field ge! electrophoresis analyses of modified 169 with the ILPA 
insertion. DN'A for two independent clones of BAC 169. ILPA (LI and L2) and BAC 169 
were prepared by alkaline lysis, and then digested with Noll, Pmel and Xfwl (in a standard 
buffer supplemented with 2.5 miM spermidine). The digested DNA were separated by pulsed 
field gel electrophoresis (Bio-Rad's CHEF-DRJI, 5 to 15s. 15 hours at 14 C C) and blotted on 
to nitrocellulose filter (Stratagene). The same filter was probed separately with three probes. 
LI and L2 are lacZl and LacZ2 which are independent clones which correspond to clones 1 
and 2 respectively in Figures 3C and 3D. Figure 4 A shows the use of the BAC 169 probe 
which revealed all the restriction fragments. Figure 4B shows the use of the pgkpoly A probe 
which only hybridized to the ILPA insert fragment. Figure 4C shows the use of the A2 probe 
which hybridized to a fragment outside the region of modification. The position of the 
markers are indicated. 



20 Figure 5 shows the production of BAC transgenic mice. Figure 5A depicts purified 

linearized BAC LI 128 kb Not I insert for pronuclear injection. The pulsed field gel is 
probed with pgkpolyA probe. The numbers represent different fractions. The smear below 
the intact fragment represent degradation and undigested DNA. Figure 5B shows Southern 
blot analyses of the founder transgenic mice with the lacZ probe. The tail DNA were 

25 digested with Bam HI and Southern blot analysis was performed. The negative control 
consisted of littermates of Y3, Y7 and Y9 mice. The positive control was a conventional 
transgenic mouse with the lacZ transgene. Figures 5C and 5D show the results of using PCR 
to determine the presence of BAC ends in the transgenic mice. The DNA at each end 
corresponding to the vector sequence is amplified and probed with a third oligonucleotide in 
30 the middie of the fragment. The appropriate size fragment is indicated. The negative 
controls are littermates. The positive control was BAC 169 DNA. Figure 5E shows the 
germline transmission of the lacZ transgene in the Y7 mouse line. Tail DNA from two litters 
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having eight m,ce each were prepared and digested with BamHl. Southern blot analysis was 
performed with the lacZ probe. 

Figure 6 shows the expression of the lacZ transgene in the brain of the Y7 BAC transgenic 
line. P6 mice brain from Y7 transgenic mice (F.gure 6A) and a w, Id type control littJr mate 
5 (Figure 6B > were whole mount stained to reveal W expression in the Y7 cerebellum. Thick 
saggi.al sections (5mm) from Y7 transgenic m.ce were aiso stained for lacZ expression. 
Figure 6C shows the low magnification and Figure 6D shows the high magnification of the 
rectangle area indicated ,n Figure 6C. Expression in the cerebellum, the detate gyrus and the 
lineage of the olfactory bulb are indicated (i.e. SVZ, RMS and the OB). Abbreviation Ce. 
10 cerebellum: SC. superior coll.col.; IC, inferior colliculi: DG, dentate gyrus: VZ. ventricular 
zone: SVZ, subventricular zone: LV, lateral ventricle; RMS, rostral migratory tract; OB. 
olfactory bulb; Co, cortex. 

F.gure 7 , s a schematic diagram containing F.gure 7A which depicts a hypothetical map of a 
gene of interest within a selected BAC; F.gure 7B which depicts the first targeted 
5 modif.cat.on to .ntrcduce the positive selection marker gene; and Figure 7C which depicts 
the second mod.f.cation to delete the promoter of the gene and to generate the short arm. 

Figure 8 is the restriction map of pSVl.RecA. Th.s temperature sensitive shuttle vector ,s 
based on the pMB096 vector originally constructed by M. O'Connor et al. [Science, 
244:1307-1312 (1989)]. 

> Figure 9 is the restriction map of pBV.IRES.LacZ.PA. This vector was modified from the 
pWHIO vector or.ginally constructed by Kim et al. [MCB, 12:3636-3643 (1992)]. 

F.gure 10 is the restr.ction map of pBV.EGFPl . The plasmid is based on pBluescript.KS(+). 
EGFP1 was from Clonetech. 

Figure 1 1 is the restriction map of pBV.IRES.EGFPl. The plasmid is based on the 
pBluescript.KS back bone. EGFPl was from Clonetech. 

Figure 12 is the restriction map of pBV.PGK.Neo.PA. The vector is based on a pBS.KS 
backbone. The pGK.Neo.PA sequences was excised from a pKS.NT vector by digestion with 
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HindHI and BamHI and subcloned into the Hindlll/Bam fragment of the 
pBV.IRES.LacA.PA. 



DEjAILED DFSCR f PTtONi OFTHF Psrv F njt in \f 

The present invention provides a s.mpie method for directly modifying an independent origin 
5 based cloning vector (IOBCV) in recombination deficient host cells including generating ~ 
deletions, substitutions, and/or point mutations in a specific gene contained in the 
independent origin based cloning vector. Such modifications may be performed with great 
specificity. The modified independent origin based cloning vectors of the present invention 
can be used to introduce a modified heterologous gene into a host cell. One specific use of 
1 0 such a modified vector ,s for the production of a germlme transmitted independent origin 
based cloning vector transgenic animal. 

Targeted independent origin based cloning vector modification can be used for functional 
studies in diverse biological systems. The ability to efficiently modify a independent oriem 
based clon,ng vector and generate an lOBCV-transgemc animal has important applications 
a tor functional analyses of genes in vivo. First, modified independent origin based clonine 
vectors can be used to study regulation of genes or gene complexes in transgenic animals" 
such as mice. Since modified independent origin based cloning vectors can be used to study 
gene function /„ vivo, a deletion, substitution and point mutation within a given gene can be 
made in a independent ong.n based cloning vector, and the independent origin based clonmc 
vector containing the modified gene can be reintroduced in vivo in its endogenous expression 
pattern. Furthermore, targeted independent ongin based cloning vector modification can be 
used to create targeted expression of a selected gene, in the expression pattern of another 
gene, without prior knowledge of all of the regulatory elements of the selected gene. An 
important application of this type is targeted expression of the ere recominase for tissue/cell 
type specific gene targeting [Kuhn et aL Science 269: 1427 ( 1 995); Tsien el al. , Cell 87: 1 3 1 7 
( 1 996)]. Finally, modified independent origin based cloning vectors can be used to generate 
large DNA constructs particularly for gene targeting in ES cells and in vivo. 

In one specific embodiment of the present invention the independent origin based cloning 
vector is a Bacterial Artificial Chromosome (BAC.) modified ,n a host E.coli cell. A targeted 
BAC modification system has several advantages over a conventional yeast based 
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modifenon sysKm . F , rs , a modified auMmaiiM|iy reium ^ ncmbm 
ei.c.en, «, af , er mod „ ;c3 ,, on , ensunn? sHb|e mamHnance ^ modif . ed 
os, s, ra ,„. ^ BAC DNA can b e ^ easlly punfied m rclatlve , y ^ . ^ 

•* n ~ h — • — * b - — *» • VAC ^ « L Z 

™rc BAC „o r a rle s ava, labl e from d , ff „ en , spKiM of amma] - 

0- LCRs a „d enhancers, , D dMe dcpen(Jcm an<| _ Mi;sra s . ^ 

rr a „ s ,„„ ess , on(D!!lo „„ a/ >:iM w „ so „ f „ it 

C, : B ltl ,. 6:(;79(l990hBradIeyc „ a/ , WeC _ (j <i997)j 
m od,,,c a ,,„„ can be applled suc „ ssive|y , o dissKt ihKc eicmcms ^ 
™d,.ed BAC may b e uscd to t , eneraK . transgemc amma| ^ p ^ 
.nre^.es in* ,he a„, m a, cdl genonK . ^ , ransgemc amma| Mn ^ • 

r 9 , it ? mammai tDrohan - ai u s - patem n ° 5 ' 5s9 ' 6 ° 4 - ^ °~ 3 , 

•996,. AUema t ,ve Iy such raodlfied BACs or pACs ^ be ^ ^ ^ ^ 
«« ,„ g e„e therapy . ,„ lhe Esamp|e be , ow a modj( . ed bee[] 

0 remonstrated. 

The m ec„od ol o gy of ,he presen, ,„„„,,„„ is V5ry ge „„ a , ^ _ 
T ; ' ~^-9" «• on BACs, s y s Km ,s r ead ly 

^-nbnunon defic.en, E,oi, ,„ addltlo „, the BAC edification e,e mplifie d herein is aiso 
*n*o ,o MammaIian .Ajiifici a ] For eMmpIe . Hamngto „ „ _ 

mc.ude ,he BAC n,od,f,ca,on taugh, by , he presem in ve„.,o„. 

In acco r d a „ C e wUh ,he p res en, .nvenfion the „ raay be emplowd 
b,o „ gy . microb , 0 , ogy , and recomomaw DN , A techn|ques withm 

»c^«, are ex pl a,„ed fu„ y ,„ , hc „ teraIurc . ^ Samferook & ^ 

^,/»,c ton „„ ; ,z»w^., faw . Second Ed ,„ on „, 89)ColdSpnngHarbor ' 
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Laboratory Press. Cold Spring Harbor, New York (herein "Sambrook et al.. 1989-)- DNA 
ao,un g: A Practical Approach. Volumes I and II (D.N. Glover ed. .985); Oligonucleotide 

■;^T U - G3,t ^ I984): ^"»*»**»> [B.D. Ha.es * S,. H.gg.ns eds . 
N 98. ) J: W.^„ [B.D. Hames & S .J. H.gg.ns. eds. ( . 984)]; ,„ /W 

, C ell c ulu.re [R.l Presley, ed. ( 1 986)]; Immobilized Cells And En^nes [IRL Press 
( > 986)j : B. Perbal, A Practical Guide To Molecular Cloning ( 1 984); P.M. Ausubel et al 
(eds.). Current Protocol, m M olecillar Blolo ^ John W| . |ey & ^ ^ ( ^ 

As used here,„ an "IOBCV" is a „ .dependent origin based Coning vector. One «amp,e of 
such a Co n ,n g vector ,s a BBPAC defined be.o,, An IOBCV generally comprises a nuc, eic 
10 aod insert which either , s or contains a gene of interest. 

A "vector" ,s a repl.con. such as p.asm.d, phage or cosm.d. to which another DNA segment 
may be attached so as to br.ng about the replication of the attached segment. A "replicon" is 
any genetic element (e.g.. plasmid. chromosome, v,rus) that functions as an autonomous unit 
of DNA replication in vivo , /.*.. capable of replication under ,ts own control. 

5 As used herein, a -Bacterial or Bacteriophage-Denved Artificial Chromosome" or "BBPAC" 
denotes a vector that is derived from a bactenum or bacteriophage such as a Bacterial 
Artificial Chromosome (BAG) which is an E. coli F element based cloning system a Pl- 
Denved Artificial Chromosome (PAC) or a lambda-based cosmid. In one embodiment the 
BBPAC encodes from 500 to 700 Abases of genomic sequences. In another embodiment 
the BBPAC encodes up to 500 kiiobascs of genomic sequences. In a preferred embodiment' 
the BBPAC encodes between 120 to 1 80 kilobases of genomic sequences. In one particu.a/ 
embodiment the BBPAC encodes .30 ki.obases of genomic sequences. A BBPAC used for 
gene targeting can be referred to as a "BBPAC targeting construct" and contains a nucleic 
acid .nsert comprising the gene targeting construct. 

A "gene targeting construct" as used herein is used interchangeably with "targeting 
construct" and ,s a nucleic actd that when introduced into a cell undergoes homologous 
recombination with the endogenous chromosomal DNA of the cel.. The nucleic acid , 
introduced into the cell to induce a modi!" 



s 

canon ofa particular gene contained on the 



endogenous .ta™™, DNA. i„c,ud,„, ,„ pa ,„ cular MM . , 0 dismp , ,„„ ge „ e „ ^ 

knockout animal. 
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As used herein a recombinant deficient host cell is "Rer-A-" u u u 
cxp-ssaRecA n ^ ec A when the host cell is unable to 

P . a RetA hke protem , inc]uding recA |tsdf . ^ 

;: d ; the s:mpiest case - the - ~ - e Protcin has been 

, : ----- ------ h eRec A -hostce„conta l n S a mu tat,o ni ntherec, 

gene tnat impairs irs function. 

™^T,T pro,e,ns mvolved hom °'° sous rKomb '— - « » 

r«A [Clark e la I.,c n ,,cW^, fe „ 3/ „ A/ ,„.„ w - 0 

exchange between homologous DNA molecules ,„ri f, f u 

^ molecules, and functions in both DNA re D air and 
8 ene„c recomb.nat.on (McKee „ „„_ 7;479 4ss 

..Rep™,- nave been found m eukarvot.c organ , sms and yeas _ ^ ^ 

■ < ^rr"' 93:3 ° 94 " 3098 ° 996,] • R --"^P— -as, are Rad „ 
eukaryo.es (Pcakman M . ^.^^ M: , 02 , 2 ., 0227 (1996)) Pr ° ,e ' n 

!; e " OTin " ^ ' mereS '" ' S ' 8Cne COma '" ed * ° hM1 ««»- or more 
independent ongm based cloning vector . 

As used herein a ..marker'- ,s an ,„d,ca,„r, .bose presence or absence can be used to 
.s.mgu.s tne presence or absence „ f a p ar ,, eular „ ucle , c aa , ^ 

orrespondtng presence or a bsence of a ,ar g er DNA wh.cb contains and/or ,s'„„ked to tbe 

he prote, . and tbus c a „ be more spectficaHy termed a -marker protem" or a -marker gene 
The erm - mar ke,. ( a„d thus marker prote.n or marker gene, ,s meant to be used ^ I 

"2;;:::;:: n ~ prwcms wh as - n — — - - ~ * 

lucuerase, and further inc udes drua resi^nm ™w l. 

a ui u 0 resistant proteins, whose presence or .k Pfl 
solelv r^cr^A a Presence or absence mav not 

soie.y regarded as a means to detect ee„s tha, conta.n the drug res.stance protem; anchor . 
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he genes tha, encode such proteins. However . drug ^ ^ 

corresponding genes can allow tn£ preferemja , ^ of M||s thai coMaio ^ 
gene (or a „er„ a ,,e,y aMow lhc countt , se|eol , on ^ ^ ^ ^ ^ ^ 

_ resent gene, arid , herefore ^ a ^ of ..^ ^ ^ ^ ^ 

> ■vitnin [he present definition of a marker. 

The term - a gene „. hic „ encodes „ marker ^ ^ ^ .^^^ ^ 

term ' marker protetn gene' a „ d d e„o,es . nuc|5|C ac , d ^ ^ ^ ^ 

A "cassette" refers to a segment of DMA tha, can be tnsened into a vector a, specific 
reaction site. The segment of DNA encodes a polypeptide of interest, and the cassette and 
.0 estnetton s„es are destgned to ensure insertion of the cassette ,„ the proper read,„ 2 frame 
for transcription and translation. The present inventton provides a recombination cassette 
•ha, tnciudes two homology fragments interrupted by an insertion, deletion or mutation 
sequence. 

-Heterologous. DNA refers ,„ DNA no , „ aBjrally IocaKd ,„ „ e or m . ^ 

S ..e of the ce„. Preferably, the heteroiogous DNA mdudes a gene foreign ,„ the cell. 

A "nucleic acd molecule" refers to the phosphate ester polymeric form of nbonttcleosides 
adenosine, guanosine. uridine or cyttdine: "RNA molecules", or deoxynbonueleos.des 
(deoxyadenosme. deoxyguanos.ne, deoxythymid.ne. or deoxycyt.dme; "DNA molecules", or 
any phosphoester analogues thereof, such as phosphoroth.oates and thioesters. in either 
stngle stranded form, or a double-stranded helix. Double stranded DNA-DNA. DNA-RNA 
anc RNA-RNA hehces are possible. The term nucleic acd molecule, and ,„ particuiar DNA 
or RNA molecule, refers only ,„ the prtmary and secondaiy secure of the molecule and 
does no, , „ to a „ y particu)ar xmary foms ^ ^ ^ douWe . 5tranded 

DNA found, ,„„,. in linear or crcuiar DNA molecules (.,. restriction fraements, 
plasm.ds. and chromosomes. r„ d.scusstng ,he structure of pantcular double-str'anded DNA 
molecules, sciences may be described herein according to the „„rm a , convention 
only the sequence in the 5 ■ to 3 ■ direction along the nontransenbed strand of DNA (, . ,h/ 
strand having a sequence homologous to the mRN A , A "recombtnan, DNA molecule"' is a 
DNA molecule that has undergone a molecular biological manipulation 
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'Homologous K(m IO ,„ e insert , oi] 

contamed by a firs, wtor lmo «que„ce 
chromosome of a cell The r , "named m second veclor. or a 

of the second vector o- chromo ' VeC '° r ""° 

- ^rr^t.T^ dna — - — * — - 

. «, codon Ihe , < a 2 ^ « <~d bv 

-m.nus , codm 8 se "* ' '™ ,a " 0n ^ « * 3 ' <-*»■, " 

=DN* from e l ^ b "' " S " 0t "™" £d to ' P"*-^ seauences 

ciJiNA trom eukaryotic mRNA "enom.V HMa „ . H^nces, 

» DNA. and even synlhetlc 0^2^ T **" ~^ 

i" a eufaryonc eel, a ooVad ' 8 """"^ * '"'^ f °' 

Transcnp.iona, and rransiauonal conrro, sequences are DN<\ recula.orv 
P~, enhance,, ,e_. and th e ,*e. lh a, p ro v,de ^ , e ~™ " 
0 sequence in a host cell r, u "press.on of a coding 

ce„. ,„ eutarvonc ce„s, Myadenvlauon si^ls are con.ro, sequences 

minimum number of bases or ,, ' d,rect.on) ,„ include .he 

above bac k8 round W ^ » ^ -scr,p„o„ a, ,e,e,s de.eoable 

(convene," " * ' — P ,,o„ lniHBlen Slte 

nea Ior exa mple. by mappme with nuclease S n , 
domains (consensus SM „,„n . . n nuclease S 1 ). as well as protein binding 

< onsensus sequences) responsible for the bind.ng of RNA polymerase. 

A coding sequence is "under the control" of ,ran S ,np„on a i and trans.at.onal , 



0 



WO 98/59060 

PCT/US98/12966 

22 

which is then rrans-RNA spl.ced and translated into the prote,n encoded bv the coding 

sequence. 

A "signal sequence" ,s included at the beginning of the coding sequence of a protein to be 
_ expresseo on the surface of a ce.l. Th.s sequence encodes a s.gnal peptide. N-termina< to the 
> mature polypeptide, that directs the host eel, to translocate the po.ypept.de. The term 
"translation s.gnal sequence" ,s used herein to refer to this sort of sienal sequence 
Translocation signal sequences can be found associated with a variety of proteins native to 
eukaryotes and prokaryotes. and are often funct.ona. ,n both types of organisms. 

A particular nucleotide sequence compns.ng a gene of interest, whether genomic DNA or 
cDNA, can be isolated from any source, particularly from a human cDNA or uenomtc 
horary. In v.ew and ,n conjunction with the present teachings, methods well known ,n the 
art, as described above can be used for obtaining such genes from any source (see e g . 
Sambrook et al., 1989. supra). 



Accordingly, any animal cell potentially can serve as the nucleic acid source for the • - 

molecular cloning of anv selected opn* rw * 

. n> selected gene. The DNA may be obtained by standard procedures 

k^own ,n the art from cloned DNA ( e. g .. a DNA "library"), and preferably is obtained from a 
cDNA library prepared from tissues with high level expression of the prote.n by chem.ca! 
synthesis, by cDNA Coning, or by the Ceng of genomic DNA, or fragments thereof 
punfied from the des.red cell (See, for example, Sambrook et a.., ,989. supra: Glover' D M 
(ed.). ,985. DNA Clon.ng: A Pract.ca. Approach, MRL Press. Ltd., Oxford, U.K Vol I ,1) 
Clones derived from genomic DNA may contain regulatory and intron DNA regions in ' 
addition to coding regions; clones derived from cDNA will not contain .ntron sequences. 

The present invention provides methods for selectively performing homologous 
recombination in a cell that normally cannot independently support homoiogous 
recombination. A specific nucleic acid ,s inserted into a recombination cassette that 
selectively integrates into a particular nucleotide sequence when the recombination deficient 
cell , s transiently induced to support homologous recombination. More particularly the 
present invention allows the integration of a specific nucle.c and into a part.cu.ar nucleotide 
sequence of a gene of interest. The methods prov.ded.bv the present invention minimize the 
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nonspecific nacleot.de sequence rearrangements and de.et.ons, which are characteristic of 
other systems which involve host cells that normally support homologous recombination. 

In one case the specific nucle.c acid can encode an entirely different protein than the gene of 
_ .merest, and the gene of interest may be selected for the tissue speci ficity of its promoter for 
> examp.e for use in generating a transgemc anima.. or ,n a gene therapy protoco.. In one such 
embodiment the rat preproenkephalin gene may be used as the gene of interest since the 
preproenkephalin promoter has been shown to confer brain express.cn and synaptic 
regulation .n transgenic mice. [Donovan et al.. Proc.Natl.Acad.Sci. 89:2345-2349 0 99^>j 
In the Example below, the mur.ne z, nc fi nger gene, RU49 was used as the gene of interest' 
Alternatively, the specific nude, acid can be constructed so as to cause a deliberate and ' 
specific mod,ficat.on ,n the sequence of the gene of interest, for example for inducing a 
change >n the amino acid sequence of the gene product, such as is typically done ,n 
site-directed mutagenesis protocols. 

In one aspect of the present invention, the recombination deficient host cell cannot 
mdependently support homologous recombination because the host cell is RecA" However 
as any person skilled in the art would readily understand, alternative causes for 
recombination deficiency may be rectified by methods that are analogous to those tau-ht by 
the present invention and'or readily apparent ,n view of such teachings. For example 
recombination deficiency may be due to a deficiency of an alternative recombination protein 
such as another Rec protein including recB, recC, recD. and recE [Clark et al Critical 
Revien, ,n Microbiol. 20:125-142 (1994)] which may be manipulated in a manner that is 
analogous to that taught herein for RecA-like proteins. 

In the case of a RecA" host ce,., inducing the host cel. to transiently support homologous 
recombination comprises inducing the transient expression of a RecA-.,ke protein in the host 
cell. Such induction may be performed by expressing a RecA-l.ke prote.n contained by the 
recombination deficient host that is under the control of an inducible promoter. 

In a preferred aspect of the invention inducing the transient expression of the RecA-like 
prote,n ,s performed with a conditional repi.cat.on shuttle vector that encodes the RecA-like 
protem. Conditional replication shuttle vectors can also include P BR322 in a polyA 
temperature-sensitive bacteria, strain. Preferably the conditional replication shutt.e vector is 
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a temperature sensitive shuttle vector fTSSV^ th-r -„~i . • 

uuic vector ( ibt,V) that replicates at a permissive temperature but 

does not replicate at a non-permissive temperature. 

Inducing the transient expression of the RecA-like protein consists of transform the host 
cell with the TSSV at a permissive temperature, and growing the host cell at a non- 
> perm,ss,ve temperature. The TSSV encodes a RecA-like protein that ,s expressed ,n the host 
ceil and supports the homologous recombinat.on between a specific nucleic acid contained ,n 
a recombination cassette and the particu.ar nuc.eot.de sequence contained in the host cell 
The TSSV encoding the RecA-like protein is diluted out when the host cell ,s grow, at the 
non-permissive temperature. 

0 In a more intneate version of the present invention, the particular nucleotide sequence which 
has been selected to undergo homologous recombination ,s contained by an independent 
ongm based cloning vector (IOBCV) that ,s comprised by the host cell, and ne.ther the 
•ndependent ongin based cloning vector alone, nor the independent ongm based cloning 
vector ,n combination with the host cell, can independently support homologous 
> recombination. In a particular embodiment of this type both the independent onem based 
Coning vector and the host cel. are RecA", and inducing the host cel. to transiently support 
homologous recombination comprises inducing the transient expression of the RecA-like 
protein to support homologous recombination in the host cell. The independent ongin based 
cloning vector can be a BBPAC. such as the BAG exempted below and the host cell can be 
a host bacterium, such as E. coli. 

The independent origin based cloning vectors for use ,n the methods of the present invention 
can be obtained from a number of sources. For example, £.co//-based artificial 
chromosomes for human libraries have been desenbed fSh.zuya « */.. Proc. Natl. Acad Sc, 
89:8794-8797 (1992): loannou « In Current Protocob ^ ^ ^ j. 

etal.) 5.15.1-5.15.24 John Wiley & Sons. New York (1996); K,m e< al, Genomics 34 .13- 
218 (.996)]. Libraries of PACs and BAG. have been constructed [reviewed in Monaco « al 
Trends BioteckoL, 12:280-286 (1994)]. that.are readily isolated from the host E enom,c 
background for example by classical alkaline lysis plasmid preparation protocols [Bimboim 
« al., Nucletc Acids Res. 7: .513-1523 (1979)], or alternatively, with the use of a 
nucleobond kit, a boiling Prep, or by cessium gradient (Maniatis. BAG PAG and PI 

l.branes are also available for a variety of species (, S . Research Genetics, Inc. Genome 
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Research. ,„ c .. Te xas A&u has . bac cen.er ,o m ak e . BAC „brarv ft, „ves,oc k and 
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promoter in a host cell, In another embodiment, the constitutive promoter is the endogenous 
E. coli recA promoter. 

The conditional replication shuttle vector should also contain at least one unique cloning site. 
When a building vector is. used for the construction of the recombination cassette as 
:> described below, one unique site is reserved for transferring the recombination cassette 
containing the specific nucleic acid from the building vector to the conditional replication 
shuttle vector. For example a polylinker can be inserted between two specific restriction 
sites to create additional restriction sites that allow cloning of the recombination cassette into 
the conditional replication shuttle vector. In any case the conditional replication shuttle 
10 vector created should minimally contain a recombination cassette comprising the specific 

nucleic acid, (e.g., containing a point mutation, deletion or a marker gene) flanked at both the 
5' and 3' ends by genomic fragments containing about 350 basepairs (e.g. 250 basepairs to 
600 basepairs though less may be sufficient) or more of the gene of interest of the 
independent origin based cloning vector. 

1 5 In certain cases a building vector is used to construct the recombination cassette. Two small 
genomic fragments, each containing about 350 basepairs {e.g. 250 basepairs to 600 basepairs 
though less may be sufficient) or more of the gene of interest are cloned into the building 
vector (e.g., pBVl) in appropriate order and orientation to generate the Hanking regions of 
the recombination cassette. DNA containing a promoter sequence 5' to the specific nucleic 

.0 acid, which in turn is 5' to a polyadenme addition signal sequence, is inserted between the 
two genomic fragments in the proper orientation. The recombination cassette is then 
transferred into the conditional replication shuttle vector (e.g., pSV I. RecA). The 
recombination cassette, the RecA-like protein gene, and the drug resistant gene are linked 
together on the conditional replication shuttle vector such that when the specific nucleic acid 

5 integrates into the particular nucleotide sequence, the RecA-like protein gene and the drug 
resistant gene remain linked together, and neither the RecA-like protein gene nor the drug 
resistant gene remain linked to the integrated specific nucleic acid. In a preferred 
embodiment the conditional replication shuttle vector is a TSSV and the TSSV is pSVLRecA 
having the ATCC no. 97968. 

0 According to the methods of the present invention the conditional replication shuttle vector is 
transformed into a RecA' host cell containing the independent origin based cloning vector. 
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The resolved independent origin based cloning vectors can be further selected for by growm 
the host ceils (e.g.. at 37°C) on plates containing the second drug and an agent that 
counterselects against cells containing the gene resistant to the first drug (e.g.. a gene 
conferring tetracycline resistance may be counter-selected against with fusaric acid). The 
5 resolved independent origin based cloning vector will be either the original independent 
origin based cloning vector or the precisely modified independent origin based cloning 
vector. One method to identify the correctly resolved BAC is to choose 5-10 colonies and 
prepare a mimprep DNA. The DNA can then be analyzed using Southern blots to detect the 
correct targeting events. Alternatively, the desired clones can be identified by colony 
1 0 hybridization using a labeled probe for the specific nucleic acid contained by the 
recombination cassette. Such probes are well known in the art. and include labeled 
nucleotides probes that hybridize to the nucleic acid sequence. Alternatively, a marker 
nucleic acid can be included in the recombination cassette and constructed so as to remain 
with the specific nucleic acid upon integration into the independent origin based cloning 
15 vector. 

The marker can be a marker gene or marker nucleic acid that encodes a marker protein that 
confers a specific drug resistance to the host cell, as exemplified above, against drugs such as 
antibiotics, e.g.. ampicillin, chloramphenicol, and tetracycline, a protein that confers a 
particular physical characteristic to the cells, such as a green fluorescent protein or a 
0 modified green fluorescent protein as described in U.S. Patent 5,625.048. Issued 4/29/97 and 
WO 97/26333 Published 7/24/97 hereby incorporated by reference in their entireties, or an 
enzyme such as luc.ferase. Alternatively, it can be another marker protein including e.g., P- 
galactosidase. 

The methods of homologous recombination of the present invention are selective, and 
: nonspecific nucleotide sequence rearrangements either do not occur, or are essentially 
undetectable by one or more conventional methods of analysis. One such method includes 
pulsed field gel mapping of the modified independent ongin based cloning vector and the 
unmodified independent origin based cloning vector to determine whether any unexpected 
deletions, or insertions or rearrangement were uenerated during the modification procedure. 
In one particular embodiment, the same filter can be probed separately with a probe for the 
whole independent origin based cloning vector, with a probe for the specific nucleic acid, and 
a probe for a region of the gene of interest that has not been modified. A restriction enzyme 
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independent origin based cloning vector of the invention such as but not limited to adenosine 
deaminase (ADA) to treat severe combined immunodeficiency (SCID); marker genes or 
lymphokine genes into tumor infiltrating (TIL) T cells [Kasis et al., Proc. Natl. Acad. Sci. 
U.S.A. 87:473 (1990); Culver et al.. ibid. 88:3155 (1991)]; genes for cloning factors such as 
5 Factor VIII and Factor IX for treating hemophilia [Dwarki et al. Proc. Natl. Acad. Sci. USA. 
92:1023-1027 (19950): Thompson. TJiromh. and Haemostatis, 66:1 19-122 (1991)]; and 
various other well known therapeutic genes such as, but not limited to, p-globin, dystrophin, 
insulin, erythropoietin, growth hormone, giucocerebrosidase. p-gtucuronidase, a -anti trypsin, 
phenylalanine hydroxylase, tyrosine hydroxylase, ornithine transcarbamylase, 
1 0 apolipoproteins. and the like. In general, see U.S. Patent No. 5,399,346 to Anderson et al. 

One particular method comprises the pronuclear injection of the modified independent origin 
based cloning vector into a fertilized animal zygote. Such a method is exemplified below 
with the modified independent origin based cloning vector being a BAC which has been 
linearized, and the animal zygote being a mouse zygote. 2 pi of 0.6 |ig/ml of BAC DNA was 
15 injected. 

The presence of both ends of the modified independent origin based cloning vector can be 
assayed for in the transgenic animal to determine if the intact nucleic acid insert of the 
IOBCV has been integrated into the genome. Since both ends of the nucleic acid insert : 
contain some vector sequence, PCR primers specific to the vector sequence can be generated 
20 and used to amplify the transgenic DNA. The amplified products can then be probed with a " 
third labeled oligonucleotide probe within the amplified region. 

The transgenic animals that are formed give rise to germline transmission after appropriate 
breeding (B6/CBA mice were used in the Example). The ratio of transgenic animals to wild 
type animals should follow Mendelian genetics. 

The expression of the specific nucleic acid and/or the gene of interest inserted into the 
transgenic animal can be determined by a variety of methods well known in the art which 
depend on the nature of the insert. For example, enzymes can be appropriately assayed for 
activity, in the case of P-galactosidase, whole mount staining can be performed, in situ 
hybridization can be used to detect the corresponding mRNA. and specific antibodies can be 
used to identify the expression of a corresponding protein. In preferred embodiments such 
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expression will be evident only in cells in which the endogenous gene of interest is 
expressed. In the Example, in which the gene of interest was the murine zinc finger RU49, 
and the specific nucleic acid inserted therein was the lacZ marker gene, analyses of the 
expression of the lacZ marker gene in the entire'cerebellum of postnatal day 6 transgenic 
5 mice closely resembled the corresponding endogenous RL'49 expression pattern. 

The present invention also provides the use of targeted BBPAC modification to obtain a 
high rate of gene targeting in vertebrates. The BBPAC contains a nucleic acid insert 
comprising the gene targeting construct. The circular BBPAC can be used, or preferably 
the linearized nucleic acid insert is used. In either case, the BBPAC or linearized nucleic 
acid insert can be purified by gel filtration as described herein. 



In one aspect of the invention the gene targeting is performed in ES cells using a BBPAC 
gene targeting construct that is greater than lOOkb. In a general sense, the BBPAC gene 
targeting construct is similar to the conventional positive selection gene targeting construct 
(Figure 7): it contains two regions of homology, a long arm (>80kb) and a short arm (10- 

15 20kb). with the neo cassette (pgk-neo-polyA) introduced into the middle of the BBPAC. 

Two targeted BBPAC modifications are used to make this construct. The first modification 
is to introduce the neo gene to disrupt the gene of interest in the BBPAC. A second 
modification is to create the short arm (10-20kb). The reason for the second modification is 
enable the use of an endogenous probe flanking the short arm (KO probe) to detect a 

20 polymorphism between the targeted allele and the wild type allele in screening ES cell 
clones (Fig 7; Gene Targeting, a practical approach, supra). 

A preferred version of the BBPAC gene targeting methodology of the present invention also 
includes negative selection. The conventional negative selection cassettes, such as the use 
of the herpes thymidine kinase cassette or the diphtheria toxin gene cassette, may not always 
25 work with BBPAC constructs since BBPAC DNA tends to exist in transfected mammalian 
cells as episomal DNA for a long period of time [Baker et al., NAR 25: 1950-1956]. In one 
example, the EGFP1 cassette can be used as a negative screening cassette. In this case, in 
the second step of modification to generate the short arm. the CMV promoter driven green 
fluorescent protein (EGFP-I) and the polyA signal can be introduced. Unlike other 
30 negative selection cassettes, GFP is not toxic to the cells but serves as a fluorescent marker 
protein. When gene targeting occurs, the EGFP-I cassette will be lost and the cell will not 
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exhibit a green fluorescence under UV light. On the other hand, when the BBPAC 
integrates non-homologously, the EGFP-l cassette also integrates, and the cells will 
therefore exhibit the green fluorescence under UV, For the definitive Southern blot 
analyses only those neo resistant cell lines which do not exhibit a green fluorescence under 
UV light are chosen. 

The process of generating the targeted ES cells with a BBPAC targeting construct is 
essentially the same as with the conventional protocols (Gene Targeting, A Practical 
Approach, supra), except for the following steps. First the linearized intact BBPAC nucleic 
acid insert (for example) is purified using the gel filtration procedure described herein. 
Next, the transfection of ES cells with the linearized intact BBPAC nucliec acid insert is 
performed as described by Baker [NAR. 25:1950-1956 (1997)], using psoralen-inactivated 
adenovirus as carriers, for example. 

The method enables transfection efficiency in mammalian cells with linear BBPAC DNA to 
be similar to the transfection efficiency of a conventional DNA construct. On the other 
hand, the BBPAC targeting construct can potentially provide 10-100 fold higher targeting 
frequency than the conventional targeting construct, thereby making gene targeting- in 
mouse ES cells easier and cheaper, since only a few dozen colonies need to be isolated and 
screened to obtain the targeted clones. 

The present invention further provides a method of performing gene targeting in fertilized 
vertebrate zygotes by the injection of a BBPAC targeting construct, or preferably the 
linearized intact BBPAC nucleic acid insert containing the targeting construct to generate a 
transgenic knock-out animal (TKO). A large targeting construct ( > lOOkb) can provide a 
very high targeting rate (predicted by mathematical modeling described above) and gene 
targeting can be directly performed with a fertilized vertebrate zygote via pronuclear 
injection of the modified BBPAC targeting construct. TKO methodology has previously 
been attempted by Brinster et al. [PNAS, 86:7087-91 (1989)] with a small DNA construct 
(2.6-8.9kb) but those workers only obtained a relatively low targeting rate (0.2%). The 
large homology DNA in the BBPAC (> lOOkb) of the present invention increases the 
targeting rate to a favorable range of 2% to lO'c 
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In one such embodiment, the design of the gene targeting construct is similar to the ES cell 
targeting construct except that instead of the neo gene, an IRES-GFP cassette or an IRES- 
lacZ cassette is fused to an exon of the gene of interest to disrupt the gene (Figure 7). As 
described above, two consecutive steps of BBPAC modifications are involved ir. generating 
> the BBPAC containing the gene targeting construct. 

The modified BBPAC TKO construct can be prepared in milligram quantities and linearized 
as described above. The linearized DNA then is introduced into the fertilized zygote by a 
standard protocol, e.g.. pronuclear injection (Hogan etal., (1986) supra). The transgenic 
animal is then identified by standard Southern blots. The gene targeting event can be 

1 0 further identified by digesting DNA of the transgenic animal with appropriate enzymes, 
such as enzyme X. (Figure 7) and probed with the flanking KO probe (Figure 7). Mice 
with the targeting event will have an additional band of the appropriate size. Such gene 
targeting events can further be confirmed by express.on of the GFP or LacZ marker gene in 
the expression pattern of the targeted endogenous gene, since the construct is designed to 

15 trap the endogenous promoter. 

The TKO method has important ramifications in the field of vertebrate genetics. It enables 
gene targeting in many organisms that do not have ES cells, such as zebra fish, rats and 
other mammals. This will help to generate better animal models for human diseases (e.g., 
rats and monkeys), or to create genetically targeted animals suitable for organ transplants 
20 (such as pigs or baboons) or for commercial reasons (e.g. . leaner pork or beef). This 

method also has additional advantages, even for gene targeting in mice. For example, this 
method will automatically provide germline transmission, since transgenic animals are 
rarely chimeric. It also enables targeted mice in strains other than the 129 strain to be 
obtained, and avoids the expensive and time-consuming out-breeding protocols. 

25 In still another aspect of the present invention, methods of performing gene targeting in 
somatic cells using BBPAC targeting constructs are provided. Since gene targeting in 
somatic cells is also dependent on the length of homology, using large DNA targeting 
construct also improves the targeting rate in somatic cells. The experimental design in this 
case is similar to that with the ES cells described above. Somatic cell gene targeting is 

30 useful in gene therapy, for example, in a targeted insertion of a functional sene in a 
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hereditary disease of the hematopoietic system. Such methods are also useful to generate 
targeted cell lines for experimental purposes. 

Conditional replication shuttle vectors that encode a RecA-like protein are also provided bv 
the present invention. The RecA-iike protein can be controlled by either an inducible 
5 promoter or a constitutive promoter. The conditional replication shuttle vector is preferably 
a temperature sensitive shuttle vector fTSSV). In one such embodiment the TSSV contains' 
both a gene that centers tetracycline resistance and a RecA-like prote.n that is recA. In a 
preferred embodiment, the TSSV xspSVl.RecA having the ATCC no. 97968. 

Independent origin based cloning vectors that contain a gene of interest that has been 
10 modified by the methods of the present invention are also included m the present invention. 
More particularly such independent origin based cloning vectors have undergone 
homologous recombination with a conditional replicatton shuttle vector >n a RecA" host cell, 
wherein the conditional replication shuttle vector encodes a RecA-like prote.n. In a preferred 
embodiment the independent origin based cloning vector has undergone homologous ■ 
5 recombination in a RecA' host ceil with a temperature sensitive shuttle vector encoding a 
RecA-like protein. In a preferred embodiment the modified independent origin based cloning 
vector is a BAC that has undergone homologous recombination with the temperature ' 
sensitive shuttle vector pSVl.RecA having the ATCC no. 97968. 

The present invention may be better understood by reference to the following non-limiting 
) Examples, which are provided as exemplary of the invention. The following examples are 
presented in order to more fully illustrate the preferred embodiments of the invention. They 
should in no way be construed., however, as limiting the broad scope of the invention. 

EXAMPT.F 

HOMOLOGOUS R F CO M B TN AT ION Rasfh M ODIFICATION! rv r f w 
GERMLINF TR ANSMISST ON TN T RA NSGFNIP Mirr of , K , \\ , g„ n . ™° 
BACTERIA f A RTrFICTAT CHRQMOSOMF ' MLUHA.Sh 

Introduction 

Bacterial based artificial chromosomes, such as Bacterial artificial chromosomes (BACs) 
and P-lderived artificial chromosomes (PACs). are circular bacterial plasmids that may 
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propogate as large as 300kb of exogenous genomic DNA (Shizuya et al, PNAS. 89. 8794- 
97. 1992: Ioannou et al. Nature Genet.. 6. 84-90, 1994). For the majority of BAC and 
PAC libraries, the average size of the insert is 130-150 kb. There are several advantages of 
usmg bacterial based artificial chromosomes for genomic and functional studies, compared 
> to the yeast based system (i.e. YACs): First, BAC and PAC libraries are much easier to 
construct due to higher cloning efficiency. Second. BACs and PACs are propaeated ,n 
recombination deficient E. coli host cells, so they have high stability and minimal 
chimensm. No rearrangements have been observed in BACs or PACs after 100 
generations of growth. Th.rd. isolation of BAC and PAC DNA is very easv since they 
1 0 exist as supercoiled circu.ar plasmids that are resistant to shearing. Conventional bactenal 
plasmtd DNA isolation methods can be applied to obtain milligrams of intact BAC or PAC 
DNA. Finally, direct DNA sequencing can be applied to BAC or PAC DNA which is not 
possible for YAC DNA. These advantages have made BACs and PACs important tools for 
genome studies in many species. 

1 5 Although BBPACs are usefu. for physical mapping in genome studies, no sim pIe method is 
available to modify BBPACs . as is avai.able for the YACs. A simple homologous 
recombination based BBPAC modification method is disclosed, termed targeted BBPAC 
modification (See Figure 7 for a schematic representation of the method). This method 
allows precise modification, such as marker insertion, deletion, point mutation, at any 
20 chosen site within a given BBPAC. This method involves several steps: isolation of 

BBPACs using cDNA or genomic DNA probes, simple mapping and partial sequence of 
the BBPACs. cloning of the shuttle vector, targeted modifications, pulsed field gel analyses 
or the modified BBPACs, and finally preparation of linearized BBPAC DNA for functional 
stud.es, such as pronuclear injection to produce BBPAC transgenic mice. Since the method 
2. .s s.mple and reliable, it is reasonable to expect that the entire procedure, from the step of 
screening for a BBPAC with a cDNA or genomic DNA probe to the step of modified 
BBPACs ready for functional srudies, can be completed within 6-8 weeks. 

Using this method, the IRES-LacZ marker gene has been introduced into an . 3 I kb bactenal 
artificial chromosome (BAC) containing the murine zinc finger gene, RU49. No 
30 rearrangements or deletions are detected in the modified BACs. Furthermore, transgenic 
m.ce are generated by pronuclear injection of the modified BAC and germline transmission 
of the intact BAC has been obta.ned. Proper expression of the lacZ transgene in the 
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cerebellum has been observed, which could not be obtained with conventional transgenic 
constructs. In summary, a novel and efficient method has been developed to modify BACs. 
PACs and PI for in vivo studies of gene expression and gene function. 



Materi als and iVfefhnrk 
- ' • Isolation and initial mapping of BACs 

(I) BAC isolation (3-4 days): 

A BAC clone is isolated with either a unique cDNA or genomic DNA probe. BAC 
libraries for various species, (in the form of high density BAC colony DNA membrane) are 
available from Research Genetics. Inc. and Genome Research. Inc. The mouse 129 
1 0 genomic BAC library from Research Genetics has proved to be a good source for genomic 
DNAs. To avoid damage to the membrane, the probe is first tested on a mouse genomic 
Southern blot to ensure that the probe does not contain any repetitive elements. The library 
is screened according to manufacture's direction. The positive clones can be obtained from 
the company within a few days. 



5 (II) Preparation ofmidiprep BAC DNA by alkaline lysis method (lday): 

Reagents: 1. Solution I: 50mM glucose, 25 mM Tris.HCl (pH 8.0); 10 mM EDTA 
(pH 8.0) 

2. Solution II: 0.2N NaOH, 1 % SDS (0.4 g NaOH. 45 ml ddH20, 5ml 

10% SDS). 

3 3. Solution III: 5M KOAc (60ml), glacial acetic acid (11.5ml), H20 (28.5 

ml). 

Protocol: 

1) . Inoculate each BAC containing bacterial to 50mi LB containing 12.5 ug/ml 
chloramphenicol. Grow overnight in 37°C. 

2) . Spin the overnight culture in a 50ml Falcon rube for 20 min. at 3500 RPM at 4 
°C. Pour off the supernatant. 

3) . Resuspend the pellet in 1 ml cold solution I. Transfer the cell mix to a 15 ml 
polybrene centrifugation tube and place on ice for 5 min. 

4) . Then add 2ml fresh ( < 2 weeks old) solution II. Mix well by inverting 
vigorously a few times. 
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Biolab's PFGE marker I or II as the high molecular weight marker and Ikb DNA ladder 
(Life Technologies Inc.) as the low molecular weight marker are used. 

The gei is then stained with ethidium bromide (I to 5000, or I to 10.000 dilution of 
lOmg/ml stock) for 30 min prior to taking the photograph. Then the gel is blotted onto the 
5 nitrocellulose membrane and hybridized to cDNA and genomic DNA probes according to 
standard protocols (Maniatis, supra). To ensure the entire cDNA is included in the BAC, 
probes/or oligonucleotides from both the 5'end and the 3' end of the gene are used to probe 
the blot separately. Those large BACs containing the entire gene are usually selected for 
BAC modification. 

10 2. Construction of the shuttle vector with the recombination cassette 

Since targeted BAC modification is a method based on homologous recombination, 
homologous sequence from the BAC has to be obtained. Two homologous sequences of 
about 500bp each (namely A and B, Figure 7) is all that is needed to construct the shuttle 
vector for BAC modification. The homologous sequences are chosen such that a given 

15 modification (i.e. insertion, deletion and point mutation) will be introduced between A and 
B in the BAC. A and B can be obtained by direct sequencing of the BACs. The 
sequencing oligonucleotides are designed based on the cDNA sequence. 



(I) Direct sequencing of the BAC (2-3days)\ 



1) If maxiprep DNA is used, go directly to step 2. If midiprep DNA is used, first 
20 add lOOul ddH20 and lOul lOmg/ml RNAse A to lOOul midiprep BAC DNA. and 

incubate at 37 °C for > Ihr. (This step is critical, incomplete RNAse treatment will 
result in poor precipitation and sequencing). 

2) Add 132 ul PEG mix (2.5M NaCl and 20% PEG 8000) to the treated DNA. Put 
on ice for 5min. 

25 3) Spin for 15 min at 4°C. Discard the supernatant. Spin again for 2 min. 

Completely remove the remaining supernatant, which contains the PEG mix. 

4) Wash the pellet with 70% ethanol. Dry in Speedvac and resuspend in 20ul 
ddH20. 

5) Run 2ul on a agarose gel to estimate the final concentration. Usually use 6-8 ul 
30 (500ng-l000ng) DNA for automatic sequencing, also use 150ng sequencing oiigos. 



BNSDOCID: <WO 9859O60A1 _l_> 



WO «>8/5<)060 



PCT/LS98/I2966 



39 



Each sequencing reaction will result in up to a 500bp sequence. Sequence more than one 
BAC for a given primer to compare the sequences. The main purpose for sequencing is to 
design a 20 bp PCR primer, which is about 500 bp away from the sequencing oligo (which 
usually is the other PCR primer), to enable PCR amplification of this genomic fragment and 
to clone it into the building vector. Therefore, as long as a 20bp sequence can be identified 
which is at the appropriate position, and which is the same in several independent 
sequencing reactions, the goal is achieved. The quality of the DNA sequence in between is 
not very critical. 



(II) . Vectors used in targeted BA C modification : 
10 A two vector system is designed to construct the shuttle vector for BAC modification 

(Figure 1). The first vector is a pBS.KS based building vector, which is used to construct 
the recombination cassette containing homologous sequence A and homologous sequence B 
and the modification to be introduced between them. The recombination cassette was not 
constructed in the pSVl.RecA shuttle vector was for the following reasons: first, it is a low 
1 5 copy plasmid so that it is difficult to obtain high quantity DNA; second, it is a large plasmid 
(1 lkb). so it is relatively difficult to clone. The building vector contains the marker gene to 
be introduced into the BAC. cloning sites flanking it (usually EcoRI for cloning the 
homology A and Xbal for homology B, and rare restriction sites such as Mlul, Pmel and 
Pac I for mapping of the modified BAC). There are two Sal I sites (or one Sal I, one Xhol) 
0 flanking the multiple cloning sites. They are used to release the recombination cassette and 
subclone it into the Sal I site of the pSVl.RecA vector, to complete the shuttle vector 
construction. One thing about designing the building vector is that there should not be any 
Not I sites within the recombination cassette, since Notl sites are used in the end to release 
the linear modified BAC for biological experiment (e.g., pronuclear injection). The map 
5 and utility of various building vectors and the shuttle vector are described below. 

(A) Building Vectors (pBV) All based on pBS.KS (Stratagene) 

pBV.IRES.LacZ.PA (Fig. 9) This vector is designed to introduce lacZ marker gene 
into a coding exon or the 3' UTR of a given gene, to study gene expression and 
gene regulation in vivo. IRES will enable the translation of the marker gene 
independent of the endogenous translation initiation codon. 
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pBV.EGFPl (Fig. 10) This vector is designed to introduce the brighter version of 
the green fluorescent protein, EGFP1 (Clontech), into an exon of a given gene 
before the endogenous ATG or fused in frame with the endogenous gene. The 
green fluorescent protein will mark gene expression in living cells and living 
organisms. Since the marker gene does not contain its own polyA addition 
sequence, the endogenous poly A sequence is used. 

pBVJRES.EGFPX (Fig. 11) This vector is used to introduce EGFPl gene into the 
coding region or the 3' UTR of a given gene, with its translation independent of the 
endogenous translation frame. 

pBV.pGK.Neo.PA (Fig. 12) This vector is designed to introduce a neo expression 
cassette into the BAC. containing the neo gene with the pgk promoter and the 
poly A addition signal. Modified BAC can be introduced into tissue culture cell 
lines (i.e. ES cells) to obtain stable transfected cells by selecting for neomycin 
resistance. This vector is particularly useful for gene targeting with modified 
BACs. Notice that although there are two identical pgkpA sequence at the 3' end 
of the neo gene, it. will not interfere with the proper expression of the neagene. 
The only consequence is that during BAC modification, one of the pgkPA sequence 
may be deleted due to homologous recombination. 

(B) Temperature sensitive, recombination inducing shuttle vector (pSVl .RecA) (Fig. 8) 

This plasmid vector was modified from the pMB096 vector originally constructed 
by O'Connor et al (Science, 1989. Vol 244, pp. 1307-1312). The pMB096 vector 
was a gift from Dr. Michael O'Connor. The original vector carries tetracycline 
resistance, and contains a pSClOl temperature sensitive origin of replication, which 
allows the plasmid to replicate at 30 n C but it will cease replication and is lost at • 
43°C. The E. coli RecA gene was amplified by PCR and sub-cloned into the Bam 
HI site, to create the pSVl.RecA vector. The Sal I site is used to subclone the 
recombination cassette from the building vector. 

(Ill) Cloning rwo PCR amplified BAC fragments into the building vector ( 6-8 days ): 
30 The first step of targeted BAC modification involves the subcloning of two small genomic 
fragments (A and B) into an appropriate building vector, which includes two steps of 
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conventional sub-cloning. One should pay attention to the following points when designing 
the A and B fragments. 

I. Each fragment should be > 500 bp (the shortest attempted was 450bp). PCR 
amplified fragment with appropriate restriction sites designed at die end of the PCR 
3 primer is the method of choice. Frequently, an additional restriction site is 

designed into one of the two PCR primers to assist in determining me orientation of 
the cloned PCR fragment. The relative imprecision of PCR amplification does not 
appear to affect the BAC modification efficiency. 

2. As mentioned before, neither A nor B fragments should containing internal 
Xbal. EcoRI and Sal I sites, since these sites will be used for subcioning. Nor 
should they contain NotI sites since NotI is used to linearize the BAC 

3. The orientation of the arms must be preserved as in the endogenous loci. 
(IV) Subcioning the recombination cassette from the building vector into the pSVl.RecA 
shuttle vector (4 days) : . 

1:> I. Prior to cloning the recombination cassette into the shuttle vector, the following plates 
are usually prepared: the tetracycline (lOug/ml) LB agar plates and the 
tetracycline( lOug/ml) + chloramphenicol (12.5 ug/ml) LB agar plates. Plates are made 
according to standard protocol [Sambrook et aL, (1989) supra], 

2. Prepare pSVl.RecA and building vector midi-prep DNA by the alkaline lysis method ( 
20 see above). For the pSVl.RecA vector, Qiagen columns can also be used to obtain high 
purity DNA. though yield is usually low. This is due to the low copy number of the pSVl 
plasmid. For preparation of pSVl.RecA DNA, the culture should be grown at 30 °C in LB 
+ tetracycline ( lOug/ml). The final midi-prep DNA is usually dissolved in 200 ul TE or 
ddH 2 0. 

25 3. Digest 2-5 ug of the pSVl.RecA and pBV with Sal I . For pSVl.RecA, the reaction is 
done in 200 ul volume: 

100 i±\ medi-prep DNA (2-5 ug) or 
20 fil of Qiagen midi-prep of pSVl .RecA 
20 ^1 H buffer (Boehringer Mannheim) 
8 ul Rnase (10 mg/ml) (for alkaline lysis preps) 
10 ul Sal I (200 units. Boehringer Mannheim) 
62 ul ddH.O 

The reaction is performed at 37 «C for >6 hours (usually overnight), then 30 units more Sal 
I is added, and the digestion continue for another 1 -2 hours. (Optional) A small sample of 
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the digestion (5ul) may be run on a gel to ascertain that a complete digestion has been 
achieved. 

4. (Optional)At the end of the digestion, Sa! I is inactivated by heating to 65 °C for 15 
minutes. 

:> 5. The vector is then created with alkaline phosphatase by adding 20ul iOx 

dephosporylaiton buffer, 4 ul < lunit/ui) calf intestinal alkaline phosphatase (Boehringer 
Mannheim) for 30 minutes at 37 rt C. The enzyme is then inactivated by adding 20 ul 50 
mM EDTA (to a final concentration of 5mM), and heating at 75 °C for 15 minutes. 

6. The digested pSVl vector and pBV with recombination cassette are run on a 1% low 
10 melting Seaplaque GTG agarose at 75 V for 8-10 hours. The DNA should be run in a 

large well created by taping together several teeth of the comb. 

7. An 1 lkb linearized plasmid band should be visible on the gel for pSVLRecA. Cut this 
band and also the recombination cassette insert band from the gel. Purity these DNA 
fragments using Geneclean Spin columns (Bio 101, Inc.) according to manufacture's 

15 direction. Run a small portion of the purified DNA on a gel to estimate the DNA 
concentration. 

8. Ligation reaction: Each ligation reaction is done in 20ul total volume containing: > 50ng 
pSVl. vector, I00-200ng insert, 2ul 10X ligation buffer (Boehringer-Mannheim),-2uI lOmM 
ATP, lul ligase (Boehringer-Mannheim) and ddH20. Ligation is carried out at 16 °C 

20 overnight. 

9. Transformation of DH5a competent cells with pSVl vectors: Half of the ligation 
reaction (10 ul) is used for transformation, by adding to 100 ul of cold, chemical- induced 
DH5a competent cells. Incubate 15 minutes on ice, then heat shock at 37 °C for 2 minutes, 
add 1ml LB to the tube, and shake at 30°C for 30 minutes. The cells are then centrifugated 

15 at 6000 x g for 4 minutes and the pellet is resuspended in 100 ul LB and spread onto Tet 
(lOug/ml) LB agar plates. Incubate the plates at 30 °C for > 15 hrs hours. 
1 1 . Pick colonies and do colony hybridization according to standard protocols [Sambrook 
et aL, (1989), Supra], probing with a fragment derived from the pBVl, such as homology 
arms (A or B)or the marker gene. Positive clones are further analyzed by restriction digest, 

0 and if necessary, Southern blots. 
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3 * lar^eted BAC Hnmnin r „ c P- comhinnrinn i^ ^^ 

(I) Equipment 

Bacterial incubator: set either at 30°C or at 43"C. 
Shakers: set either at 30°C or at 43°C. 
(//) Reagents and Plates 

The following reagents and plates should be prepared prior to the targeted modification 
experiment. All the p.ates can be stored in 4 "C for up to one month. Detai.ed methods for 
preparat.on of various antibiotic resistant plates can be found in Maniatis. 

1. Tetracycline stock solution f iOOOX): 10 mg/ml in 50% ethanol. wrapped in 
aluminum foil and stored in -20°C for up to one month. 

2- Chloramphenicol stock solution ( 1000X): 12.5 mg/ml. dissolved in ethanol 

(>50%). stored in -20 °C. 

3. Tetracycline places (tet plates): LB agar plates containing 10 ug/ml tetracycline. 
Store m 4 °C and wrapped in aluminum foil to avoid the light. 

4. Chloramphenicol plates (Chi plates): LB plates contain 12.5 ug/ml 
Chloramphenicol. 

5. Tetracaine- Chloramphenicol plates: LB plates contain lOug/ml tetracycline 
and 12.5 ug/ml chloramphenicol. 

6. Fusaric acid + Chloramphenicol TB plates (FA + Chl plates): Prepared as 
20 following. 

First, make tryptone broth agar, or TB agar: 

500ml TR 1L TB 



15 





Tap H 2 0(not distilled FLO) 


500 ml 


1L 


25 


Bacto tryptone 


5g 


10g 


Yeast extract 


0.5g 


lg 




Glucose 


0.5g 


lg 




NaCi 


4g 


8g 




0.1M ZnCl, 


0.25 mi 


0.5 ml 


30 


Chlorotetracycline (6.3mg/ml) 


4 ml 


8 ml 


Bacto agar 


7-5g 


15g 
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Autoclaving the above TB. Also autoclave 500 ml of 1M NaH 2 P0 4 .H : 0. After 
autoclave, wait till the TB agar drop to about 60°C, then add the following: 



500ml TB 



1L TB 



NaH : PCK.H : 0 (1M) 

Fusaric Acid (2mg/ml, filter ster.) 



36 ml 



72 ml 



3 ml 



6 ml 



Chloramphenicol (12.5 mg/ml ) 



0.5 ml 



1 ml 



Pour the plates and leave the plates outside overnight and then store at 4 °C. There 



is no need to avoid the lieht. 



1 0 (I I I) Making competent BAC containing bacteria (lday): 

A chemical method is used to prepare competent cells from BAC containing bacteria host 
(Inoue et al. Gene 96, p23-28, 1990). 
(1) Media and plates: 



20 (2) Frozen stock of BAC containing DH10B cells were taken by a metal loop and 

inoculated into 3 ml of LB-+-chloramphenicol (12.5ug/ml). Grow the culture with rigorous 
shaking in 37 °C for overnight. 

(3) Take 0,5ml overnight culture, add to 50mi LB-hchioram. (12.5ug/mi) and grow at 37 
°C with rigorous shaking till an optical density at 600 nm of about 0.6 is achieved. 
25 (4) Place the flask on ice for 10 min. Then transfer to a 50ml falcon tube and centrifuge at 
3000 rpm for 10 min at4°C. 

(5) Pour the supernatant. Resuspend the pellet in 16ml ice-cold TB. Incubate on ice for 
lOmin, then spin again as above. 

(6) The cell pellet was gently resuspend in 4 ml of TB supplemented with 1% DMSO. 
30 Incubate on ice for lOmin, then dispense 0.5ml aliquot and immediately frozen by 

immersion into liquid nitrogen. The tubes are stored in -80 °C for further use. 



15 



LB + Ampicilin (50 ug/ml) agar plates; 

TB media (lOmM Pipes, 55mM MnCU 15mM CaCl : and 250mM KCl), all 
the components except for MnCU are mixed and the pH is adjusted to 6.7 
with KOH.. Then, MnCl 2 was dissolved, the solution was sterilized by 
filtration through a 0.45u filter unit and stored at 4°C. All salts were added 



as solids. 
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(IV) Co-integrate formation and identification through Southern blot analyses (4 days)- 
I . Transform the competent BAC cells with the Ts shuttle vector, using 10 ul of the 
midiprep DNA and 200ul BAC containing competent cells. Transformation is done as in 
(IV) of part H. Plate 1/10 of the transformed cells onto Tet + Chl plates, and grow 
5 overnight at 30 "C. 

2. To generate co-integrates, single colonies (up to 6 in total) is picked up with a sterilized 
metal loop and diluted each into 1ml LB. Vortex to disperse the bacteria in LB. Plate 
lOOul LB + Bacteria on to two Tet + Chl plates. Incubate one at 43 «C incubator, and 
incubate the other at 30 °C overnight. 
10 3. A thick lawn of bacteria will grow on the plates incubated in 30 "C. For the plates 
incubated in 43 "C, only dozens of individual colonies will grow on top of a hazy 
background of very small satellite colonies. Pick 20 of these large colonies, inoculate each 
colony to 2 ml LB supplemented with tet (10ug/ml) and chloramphenicol (12.5 ug/ml), and 
streak the same colony onto a tet + chl plates. Grow the miniculrure with rigorous shaking 
15 at 43 °C overnight. Incubate the master plate at 43 °C incubator overnight and stored in 4 
"C for further use. 

4. Make miniprep DNA from a 1.5 ml miniculrure using standard alkaline lysis methods. 
Dissolve the DNA in a 30 M 1 TE and use 5-10 M l of the DNA for restriction enzyme 
analysis. 

20 5. Restriction digest with appropriate enzymes and analysis of the co-integrate by Southern 
blot. Due to the high efficiency of co-integrate formation even with 500bp homology 
( > 10%). I usually only analyze co-integration on one homology side (either A or B). For 
example, to analyze co-integrate on A side, use fragment A as a probe and digest the BAC 
DNA with an enzyme that will detect the co-integrate formation on A side (such as EcoRI). 

25 Standard southern blots are done to reveal the co-integrates. As controls, the original BAC 
and the shuttle vector should be included in this analysis. The reason to use the homology 
arms as Southern blot probes is that it will hybridize to two bands of appropriate size in the 
co-integrate BAC. As controls, the original BAC and the shuttle vector should be included 
in this analysis. 
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(V) Resolution and Southern blot analyses of correctly resolved BACs (6days): 



I . Once the co-integrates are identified, a purified single colony of the co-integrate from 
the Tet-f Chi plares grown at43°C is picked and streaked onto a Chirorampenicol plate 
.( 1 2.5ug/rnlt) to grow single colonies. 
5 2. Incubate the Chi plate at 43 : C overnight, to allow some bacteria to resolve and to lose 
the temperature sensitive pSVl plasmid, and hence lose the tet resistance gene. 

3. To select for tet sensitivity in the resolved BAC ? 8 to 16 single colonies from the Chi 
plate are picked, and streaked onto Fusaric acid + Chloramphenicol plate (2 to 8 individual 
colonies can be streaked onto each plate). Two controls can be done to test the 

10 effectiveness of antibiotic selection of the FA-f-Chl plates: one is streaking a Tet-resistant 
colony (from the Tet + Ch! plate), and the second is a tet-sensitive colony (from the plate 
growing the original BAG). Another control can be done is to streak the co-integrate 
colonies on just Chi plate (without fusaric acid). 

4. Incubate the FA + Chl plates at 37 °C for 2-3 days. A long incubation time is necessary 
15 since the resolved colonies grow very slowly due to the presence of the fusaric acid. Tet 

containing colonies should not grow even in 48 hrs incubation. Therefore, there should be 
much fewer colonies on the Chi + Fusaric acid plates than on the Chi plates. These colonies 
are the resolved colonies. 

5. A) Two alternative methods can be used to identify the correctly resolved BACs. If 
20 both A and B homology are about the same length, one can just pick 10-20 colonies, 

prepare miniprep DNA by alkaline lysis and do Southern blot to analyze the targeting 
events. About half of the resolved BACs should contain the correctly targeted marker 
genes. B) If the two homology arms are not the same length (>500 bp difference), one 
should use the colony hybridization to select the correctly resolved BACs. Pick 50-100 

25 individual colonies from FA + Chl plates, streak them onto Chi plates and also onto the 
Tet-hChI plates, as a control for Fusaric acid selection. Each plate can accommodate 50 
test colonies and two positive control colonies, which are the co-integrate colonies from the 
Chi plate. Grow the colonies overnight at 37 °C. Abundant colonies should grow on the 
Chi plate, and none on the Tet-f Chi plate, except the positive co-integrate controls. The 

30 selection for tet sensitivity at step 4 is very stringent and has essentially no background. 
Therefore, all the colonies that grow on FA + Chl plates have been found to contain 
resolved colonies. Colony hybridizations is performed, according to the standard protocols 
[Sambrook et al., (1989) supra], to select for the colonies that are resolved and resulted in 
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targeted mod.fication. The colony hybridization probe should be part of the recombination 
cassette excluding the arms, such as lacZ. Neo. GFP or polyA sequences. 

6. Midi-prep DNA are prepared for the positive clones by the alkaline lysis method as 
described above. Restriction digests and Southern blots are performed to confirm targeting 
event on both homology side (A and B). 

7. Pulse field gel analyses should be done to confirm the modification event and to 
determine if there are any rearrangements in the modified BACs. Since there are two Not I 
sue flanking the BAC insert (Research Genetics), digestion with Not I should reveal the size 
ot the modified BAC. Generally Mlul. Pad and Pmel sites are included in the 

) recombination cassette. Digestion with these enzymes will confirm the targeting events 
Double digestion with these enzymes and with Not I will help to determine the integration 
sue of the recombination cassette in the BAC. Xhol is usually used to fingerprint the 
modified BAC, since it has a wide distribution of fragment sizes. Comparing the Xho 
d.gestion pattern of the modified BAC with the original BAC will reveal anv gross 
rearrangements in the modified BAC. Other enzymes, such as BamHI and Avrll can also 
be used for this purpose. Targeted BAC modification has been found not to introduce any 
unwanted rearrangements into the BACs. Probes used to hybridized to the PFGE blots 
include: insert specific probes (s.a. lacZ. PolyA. GFP and Neo) and whole BAC probe (to 
reveal all the digested bands from the BAC). Once the modified BACs are confirmed to 
have the specific targeted modification events and the lack of rearrangements, these BACs 
are ready to be used for the biological experiments, such as producing transgenic mice or 
transfecting cells. 

4 ' Preparation of large quantify Wwh duality lin Pa H , ed BAC DNA f or n rnn„n^ r 
injection 

(I) Maxiprep BAC DNA preparation^ day): 

See the isolation and initial mapping of BACs section above. 

(II) Prepare intact linearized BAC DNA for pronuclear injection (1 day): 

1- Digest 50ug cesium banded BAC maxiprep DNA overnight in 500ul total volume 
containing: 

50 fig DNA 

50 fil 10X NotI buffer or Buffer 3 (NEB) 
50 Ml 10XBSA 

12.5 nl lOOmJvl Spermidine (Final concentration 2.5mM) 
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25 M l (250units) Not I (NEB) 
ddH20 to 500ul total volume 
Digestion is carried out at 37 °C for > lOhrs. 
2. Preparation of the CL4b Column (performed at room temperature): 
5 Take a 5ml plastic pipette, air-blow the cotton to the tip and clamp the pipette on a stand. 
Shake the CL4b sepharose (Phaniacia) well, and gradually add the sepharose into the plastic 
pipette. Add until the packed sepharose to almost the top (with about 1ml space to spare). 
Never let the column dry. 

3. Once the column is ready, -use a 10ml syringe to set a reservoir on top of the column 
10 (buffer is added to the reservoir). Then equilibrate the column with 30ml of the injection 
buffer (lOmM Tris.HCI,pH7.5. O.lmM EDTA and lOOmM NaCl). This takes about 2-3 
hours. 

4. Now add 5ul 10X DNA dye into the 0.5ml digested BAC DNA. Take the reservoir out 
and gently add the DNA(-hdye) onto the top of the column with a pasteur pipette. Wait 
15 until the DNA+dye just goes into the column, gently add 0.5ml of injection buffer on top 
of the column. 

5. Once the injection buffer almost goes in, the reservoir is put back with 10 ml of injection 
buffer in it. Now start collecting 0.5ml fraction with a 24 well plate. Generally about 12 
fractions are collected (or until the blue dye is almost at the bottom of the column). 
20 6. Run 50ul of each fraction on a pulse field gel to identify* the appropriate fractions. The 

bands should be visible after ethidium bromide staining, A Southern blot is performed in 

order to choose the fractions with highest yield, and the least degradations. 

7. Purified DNA is stored at 4 P C. It is stable for weeks (e.g. . no degradation was detected 

after 3 weeks). 



Results 

BACs are useful as tools for studying the regulation of gene expression in vivo. In one 
particular example, a BAC can include the murine brain specific zinc finger gene, RU49 
[Yang et a/., Development 122:555 (1996)]. RU49 has been shown by in situ hybridization 
to be expressed in the granule cell population of the murine cerebellum, the dentate gyrus 
and the olfactory bulb in the brain. However, proper expression of the lacZ marker gene 
could not be obtained in the cerebellum with a 10 kb RU 49 promoter-iacZ construct in 
transgenic mice, e.g.. only one out of ten lines showed partial expression in the cerebellum. 
To overcome this problem, an homologous recombination based method for inserting an 
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IRES-U,cZ marker gene into the BAC containing RU49 was developed. The germline 
tran S m,ssion ,n transgen.c mice of an intact modified BAC and proper expression of the lacZ 
rransgene m the cerebellum is demonstrated. 

To modify BACs ,n E.coli. a temperature sensitive shuttle vector based svstem for 
, homologous recombination was employed [O'Connor « a/.. Sconce 244: 1 307- 1 3 P ( 1 989)- 
Hamilton etai.J. BacieHoL ,71:4617(1989),. This temperature sensitive plasmid win 
rephcate ,n cells growing at the permissive temperature (30*0. but will be lost in cells 
growing at the restnct.ve temperature (42-44°C) because ,ts origin of replication can not 
function at the restrictive temperature [Hashtmoto-Gotoh et al.J. Bacterial. 131 405-4 12 
0 (1977)]. To overcome the recombination deficiency of the BAC host ,.e.. a RecA' host ceil 
the E.coli recA gene was introduced into the temperature sens.t.ve shuttle vector When 
transformed with the temperature sensitive shuttle vector (carrying a recombination cassette 
containing the recA gene) the host strain becomes conditionally competent to perform 
homologous recombination allowing /„ vivo modification of the resident BAC. 

The genera, strategy for targeted BAC modification is show, in Fig. 1. which iHustrates the 
steps involved ,n inserting a marker gene, e.g., IRES-lacZ-pGK polyA (ILPA), ,nto the BAC 
F.rst, two small genomic fragments, e.g., A and B. each containing greater than 500 basepairs 
of a gene of interest are cloned into the building vector (pBVl) in appropriate order and 
or.entat.on to generate the recombination cassette. The recombination cassette is then 
transferred .nto the temperature sens.t.ve shuttle vector (e.g. , pSV/.RecA). The reason the 
recomb,nat.on cassette is not built directly in the shuttle vector is due to the relative 
difficulty in manipulating its DNA, due to low copy number [Bochner et ai. J. Bacterial 
143:926 (1980): Maloy et «/., Bacteria!. 145: 11 10 (1981)] and large vector size (, Ikb). 

This shuttle vector is then transformed into E.coli containing the BAC. The transformants 
can be selected by tetracycline resistance (carried by pSVl.RecA) and chloramphenicol 
res,stance (earned by the BACs) at 30°C. Since the shuttle vector also carr.es the recA gene 
homologous recomb.nat.on can occur between the shuttle vector and the BAC. through either 
homology at A or B to form co-integrates. The co-integrates are selected by growth on 
tetracyci.ne and chloramphen.col plates at 43'C. Th.s temperature ,s non-permissive for 
shuttle vector replication, so that the non-integrated, free shuttle vectors are .ost, result.ng .n 
the se.ect.on for bacteria carrying the integrated shuttle vectors, (either into the BACs or into 
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the bactena, chromosomes). Correct BAC co-integrates can be .dentified by Souther, blot 

analyses. 

The c„.,„,e g ra,es are ,„e„ rcstreakcd omo cHoramphc „ j(;oI p|a , cs ^ ^ ^ ^ ^ 
ovem.ght. A fraction of the co.,„,egra,es win ..,„ d ergo a seco „ d r< ,. omD , na „„„ evem 

* 'r 0iU "° nl - ,h ™ USh e ' ,hCT h °™"^ « A « B The resolved BACs wfl, automatical lose 
■he ,„ and ,he genes, since the excised shu„i. vector plasmids cannot replicate at'the 
non-permtssive temperature. The resolved BACs con be selected bv crowm. on 
cmoramphen.col and Il)Sar , c acld p|at „ „ 37 „ c ^ ^ ^ ^ ^ ^ ^ ^ 

-he loss of tetracycline resistance. ,. t ,, counterselecng against BACs that ate resistant to 
.eu-acyc„„e. As illustrated in Figure ,. depending on which pair of homologous fragments 
undergo the second recombination even,, the resolved BAC can be either the orttttna", BAC or 
■he prec,se, y modified BAC. The desired clones can be identified by co!o„ y hybridization 
using a labeled probe for .he inserted marker. One tmportan, aspect of the method ,s tha, the 
_ «tf gene ,s o„, y temporally introduced ,„,„ the bacteria, host. Once the mod,f,ca„„„ ,s 
o nmshed. the bacteria w,l, automatically lose the r ecA gene, renaming ,„ the recombination 
deficient state suitable for stable maintenance of the modified BACs. 

This strategy termed targeted modification of BACs. was tested b y introducing the IRES- 
lacZ- P o ly A ( M , marker into the ,3, kb murine BAC 169 conta.ntng the M49 tocus (Fig 
2 A). In ,h,s case, the marker gene ,„ the firs, coding con of the RU49 gene was ,arge,ed 
>v,h homology fragments being 1 kb and 1 .6 kb respectively <F,g. 2B,. Placing the ,RES 
sequence before the lacZ gene ensures the translation of the marker «„e even when la cZ 
gene ,s placed after the translation star, s,,e [Pelletier . AW C 334:320 (1988)] The 
pSV,Rec.4 temperature sensitive shuttle vector containing the recombination cassette was 
transformed into the DH10 E, oli strain containing the BAC 16i > and selected b y growth a, 
Cher 30°C or 43'C on plates containing chloramphenicol and tetracvcl.ne. In contrast to 
S r„w,h „ 30-c. which produced a thick law. of rransformed cells, growrh a, 43'C resulted 
■n growth of indivtdual colonies. Twenty of these were picked and tested by Southern blots 
for co-tmegration of the shurtle plasmid .„,„ BAC169. As show. i„ Fig. 3B analysts of 
twenty clones using the Bl fragment of the RU49 homology cassette resulted ,„ the 
■dentificatton of two clones containing the appropriate 4 and 8 kb EcoRl bands (10%, 
mdicating tha, these dones carry co-integrates tha, have occurred through ,h,s region of 
homology. 
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The co-.ntegrates are then resolved as described above by growing the cells first on 
chloramphenicol plates at 43X and then on chloramphenicol and fusanc acid plates at 37°C 
Fusanc ac,d provides a srrong counterselecuon against bactena containina the tetracycline 
re S1 stance gene. Indeed. 200 colon.es p,cked from these plates were all tet sensirive ' 
^ md.catmg the stringency of the selection. Duplicated colon.es erow.nu on the 

chloramphenicol plates were used for colony hybridization w.th the^o/,, orobe Ei.ht 
out ot 200 colonies were positive (4 Southern b.ot analyses us.ng either homology at -U 
or B 1 as the probe showed that all these clones contained correctlv resolved BACs (F^ 3C 
and 3D). Three BACs (lanes 4.5 and 8) also contained wild type bands, which may represent 
ether con:am.nat,on from other Cones, or a BAG conta.nmg two cop.es of co-.ntegrates that 
resolved through two different homologous regions. 

The next step ,n our analysis was extensive mapping of the modified BACs to determine 
whether any unexpected deletions or insertions were generated dunng the modification 
procedure. Fig. 4 shows pu!se d fie.d gel mapping of the modified BAC L 1 and U and the 
* or, g ,na, BAC 169. The same filter was probed separately with the whole BAC169 probe 
wtth a probe from the inserted marker gene (p^olyA) and a probe from the 5' non-modified 
region of the RU49 gene (A2). BAC169 probe (left pane.) hybrid.es with .„ the restriction 
fronts for each BAC. Thus. XHol digestion reveals a finger print of the modified BACs 
showing that essentially a„ fragments are preserved. The only difference ,s that the fragment 
containing the I LP A insert ,s slightly smaller than the corresponding w„ d npe fragment due 
to the replacement of the 7 kb RU49 fragment with the 4 kb marker gene (Fi B >B, 
Digestion with Ar«, which releases the entire BAC insert, also reveals a slightly smaller 
DN A msert in modified BACs for the same reason. Since the marker gene Jas engineered to 
carry an additional /W site (Flg . 2) , dlgestion of the BAC u ^ ^ ^ ^ 

enzyme resuits in the generation of two fragments. ,n contrast to the s.ngle fra^ent seen ,n 
the original BAC169. The s.zes of these fragments a„ow the determination that these BACs 
contain approximately 75 kb 5< to the PmeI site . and 53 kb 3" to it (Fig. 2). No apparent 
rearrangements have occurred during the mod.ficat.on procedure. 

To confirm this conclusion, the modified BACs and BACI69 were probed with both a 
marker specific probe {pgkpolyA) and a prohe near (he promoter region ^ ^ ^ 
mooification region ( A2). Consistently, both nulled BACs contained a single band 
Homologous to the marker gene probe which ,s not present ,n BAC169. VV^en the A2 probe 
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was used, a single band of expected size appeared ,n all three BACs. Additional 
fingerprinting of all eight modified BACs with Hlndlll. EcoRl and ^digests showed that 
no detectable rearrangements or deletion existed m these BACs. Thus, the temporary 
introduction of the recA gene into the BAC host strain does not introduce any rearraneements 

:> or deletions. 



To test the reproducibility and reliability of the targeted BAC modification, the BAC L 1 was 
further modified by replacing the IRES-lacZ sequence w.th pgk-neo sequence. In this case 
homologous fragments of about 500 bp each were used. The modified BACs were also 
efficiently obtained and shown not to have any rearrangements or deletions. Therefore, 
targeted BAC modification is a simple method to precisely modify BACs without 
introducing any unwanted changes in the BACs. 



To demonstrate the possibility of us.ng the modified BACs for /„ ,i,o stud.es for gene 
expression and gene function, transgenic m.ce carrying the modified BAC169 with the IRES- 
LacZ insertion were generated. To purify the 128 kb BAC insert for pronuclei injection 
1 > several established methods for purifying large YAC DNA were attempted, and resulted in 
considerable amount of DNA fragmentation. In contrast, when a simp.e ge. filtration column 
filled with SEPHAROSE CL-4B was tried, very pure fractions of intact linear BAC DNA 
.nsert were obtained in an appropriate injection buffer, e.g., 100 mM NaCl. 10 mM Tris HC1 
PH 7.5 and 0.1 mM EDTA (Fig. 5A). Unlike YAC DNA purification which typically results' 
20 ,n a low DNA yield, the purified fractions using the SEPHAROSE CL-4B column contained 
a large quantity of high concentration ..near DNA 0.5 mis of 3 ..g/m. DNA or more) 
The purified DNA could be directly visualized with ultraviolet light after ethid.um bromide 
staining. The SEPHAROSE CL-4B column could also efficiently separate the degraded 
DNA (,„ this case in fractions 3-6) from the pure linear DNA (fractions 7-9) (R. 5A) 
2> Fraction 8 contained 3 ,gml DNA and was used directly for pronuclei injection. 

Pronuclear injection into the fertilized C57BL/6 mouse zygote is performed according to a 
standard protocol [Hogan « ai, ,n Manipulating the Mouse Embryo (Cold Spring Harbor 
Laboratory Press, New York, 1986)]. Two different concentrations of fraction 8 BAC DNA 
(obtained as described above) were used: > . . . g „„ and 0.6 ,<g/ml. No newborns were 
30 obtamed with the high concentration DNA. s U „ that the high concentrations may be 
toxic to the zygote. .However, with the lower concentration of pure linear DNA, 15 newborn 



ISDOCIO. <WO 9659C6CA1 J_> 



10 



WO 98/59060 

PCT/US98/ 12966 

53 

mice were obtained and two of them (\ "?<>/ i \r-> *™ 

ana wo or tnem (13/6), Y7 and Y9, contained the IacZ marker gene as 

demonstrated on a Southern blot fFicr iFn tt,„ 

° 0t (h,g - 5B) " 1116 intensity of the bands allows an estimate of 
2-3 transgene copies for Y7 and one copy for Y9 

To determine if the ,ntact BACs have been integrated into the genome, the oresence of both 
ends ot the BAC ends was assayed for in the transgenic mice. Since both BAG ends contain 
some vector sequence. PGR primers specific to the vector sequence were generated and used 
to amplify the transgenic DNA. The amplified products were then probed wnh a third 
labeled oligonucleotide probe within the amplified region. As show. , n Fig. 5C and Fi* s D - 
V3. ,7 and Y9 have both ends present, whi.e the negative controls do not. Since Y7 and Y9 
also have the / flc Zgene. they are l.keiy to contain intact BAC transgenes. For Y3 wh- 
has both ends ,t does not contain the lacZ gene. Th.s may be due to either a rearranaement 
fragmentation during the injection prior to integration. 



reas it 
or 



) 



The Y7 transgenic m.ce also gave rise to germl.ne transmission after breeding with B6/CBA 
rn.ee. In two litters having a tota. of eight pups, three pups carried the LacZ transgene (F, g 
• 5E). Further analysis demonstrated that the transgene was transmitted in a Mende.ian 
distribution to more than fifty Y7 offspring. 

Next the expression of lacZ gene i„ the cerebellum of the Y7 transonic mice was 
determined by whole mount IacZ sta.n.ng. RU49 1S normaliy expressed .„ ^ ^ 
of the cerebellum, the dentate g>TUS and the olfactory bu.b (including the subventncu.ar 
zone, the rostral migratory stream, and the olfactory bulb prooer) [V.„ B e . „, 
.22.55,066 ,,9,6,,. ■„ previous studies. RU49 promo,er iacZ transgenic m,ce wi,h 10 kb 
_ had been generated. Howeve, a„ of lhe Mnsgen , c lmes showed 
ffecsi Cher ,he y d,d no, express ,„ ,he bra,„ ., a„. or ,„e y were ec,op,ca„ y expressed in 
-* ««. b», no, ,„e cerebel.um. One particular 10kb-,.cZ rrans 8 e„,e ,iPe did show 
rested express.cn ,„ th e cerebeHum. however. ,he expression was r e sm «ed , 0 thc caudal 
ha.fof.be cerebeHun, W,,h , 2 8 kb of RU49 endogenous seouence surrounding ,be ,»c2 
gene ,„ ,he V7 hue, a, postna,a, day 6. .he transgenic m,ce showed a ,acZ expression pa. t ern 
close ly resembhng the endogenous expression pane™ (Fig. 6, ,„ ,he C erebe„um, ,be marker 
gene ,s expressed .hroughou, ,he cerebeHum (Fig. 6A, and no expression ,s seen ,n five 
~n.ro, itoerma.es (Fig. 6B). Funher analvs.s showed .ha, ,he rransgene is expressed a, h,h 
■eve, ,n ,he EGL and iower ieve, ,„ ,he IGL. The ,acZ marker gene ,s a,so expressed in .he" 
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~ e , ne pos „, ona| tffects Md confe r ^ properesprsssion a{RU49 
> ">™ r '«""s U s,„gco„v e „ ti o„ 2 l ra „ sge „ lcconstnicts """"" 

As rau SllI neren, bacKria , based a „ ificial chromosomes <BACj ^ ^ ^ 
~,„ g ,a, gc DNA f „ r gene urgeli „ s . As ^ ^ ^ 

n^ncancn m eU,o d . BAC. a „ d PACs can b e read „ y modified , 0 inIroduce ^ ^ 

-™ um e as maki „ g . convemiona , , arseling consmci (m momhs> Moreover bb 
consIrucl DNA can be easi.v iso la ,ed in milligram 0 „ aMlty and „ ich qua 

;::™:~ r the yac sys,em - sm - - - diffa " • *- — — 

T* pr escn, i„ve„„o„ ,s no, «o be „ miKd in scope by the sp „ ific tmood , mems descnbe 
here.n. ,„ deed . v , rious modiflcalio „ s „ f ^ ^ ^ ^ ^ 

wit b eco,™ apparew I0 those ski , |ed ,„ , he m from ^ foresoiiig desCTiption ^ 

aceo mp an y ,„ 6 fi gure , Such mcMcaUons are imended to fa|1 „,,„,„ ,„ e 

appended claims. 



20 o'/iirr ,o be understood ,ha: an tase s,zk - ° mm ° acid — - - 

or mo ecu ar raass values , glvc „ for „ uc|cic acjds of poiw . (ta apprMimate 

proviuea for description. 

Vanous pufc ,,ca,,o„s ar e c,<e d „e re ,„, ,„e disclosllres of whlch are ^ 

in their entireties. 
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WHAT rsrr m KFP r< . 

I AmethodofseleCvCy peeing homologous recomb.nat.on on a particular 

- nuceo-de seance conta.ned in a recombmat.on deflcient _ ^ ^/J"* 

; eel, ,Z n lh ;r d T S 3 reCOmb ' nat,0n CaSSeHC ,nt ° ^ — Host 

5 into * e ™l r T ,on cassette ~ a — «*■ - — y „ 

reconb, ; ; e " "** ^ h ~ » ^ - support Ho.ologous 

-cm ma on . and w , erem (he host cannof md£pendemiv 

recombination: anH 6 



6 

7 recombination: and 
8 

9 



' L ri The , me ' h0d 0rC ' afm ' Where ' n ^ deficient Ko St „« cannot 

- .^Pcnd^y SupP „ n homologous recombmation because - 

■nduc.ng ,„e transten. expression of a RecA ., ike pr0 , em ,„ ^ ^ ^ 

' L ^ 7' h ° d ° f C ' aim 2 ' Wherem mdUCin5 ^ «P«*» °f - *«A„ k e 

• pro,e,„ , s performed w,,„ „ cond ,„ onll ^ ^ 

1 nC ""^ ° f Cla ' m 3 •*•"•» «« ««««<-« shuMe vector ls . 

™pera W re se„s,,,ve shuttle w (TSSV) , hal rep|kates m a 

does no. replica,, a, a non-permissive temperature. 



(0 transforming the host cell with the TSSV ; 



at a permissive 

emperature, wherem the TSSV contains a RecA - llke protein gene ^ j§ 
the host cel. and supports the homo.ogous combination between the nuci 
and the particular nucleotide sequence: and 



in 

eic acid 
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7 
8 



(iO growing the host cell at a non-permissive temperature; wherein the 
TSSV encoding the RecA-Iike protein is diluted 



out. 



4 



_) 
4 
5 
6 
7 



1 6. The method of Claim 5 herein the permissive temperature is 30= C and the non- 

permissive temperature is 43 ~C 

1 7. The method of Cairn . wherein the particular nucleotide sequence ,s contained ,n an 
- mdependent origin based cloning vector (IOBCV) that is comprised bv the host cel. and 
< where,n neither the IOBCV a.one. nor the IOBCV ,n combination with the host eel/ can 
independently support homologous recombination. 

8- The method of Claim 7 wherein neither .the IOBCV alone, nor the IOBCV , n 
combination with the host eel,, can independently support homologous recombination 
because both the IOBCV and the host eel, are Reck, wherein inducing the host cell to 
transiently support homologous recombination comprises inducing the transient expression 
of the RecA-hke protein to support homologous recombination ,n the host cell; and where.n 
•nducmg the transient expression of the RecA-I,ke protein ,s performed with a conditional 
replication shuttle vector that encodes the RecA-like protein. 



1 9. The method of Claim 8 wherein the conditional replication shuttle vector is a , 
I temperature sensitive shuttle vector (TSSV) that replicates at a permissive temperature but 
> does not replicate at a non-permissive temperature. 

10. The method of Claim 9 wherein the RecA-like protein ,s controlled by an inducible 
promoter; and where,n the transient expression of the RecA-hke protein ,s achieved by the 
transient induction of the inducible promoter in the host cell. 

11- The method of Claim 9 wherein the IOBCV 1S a BBPAC and the host cell is a h6st 
bacterium. 



12- The method of Claim 1 1 wherein the TSSV also contains the recombination cassette 
and a first gene that bestows resistance to a host cel. containing the TSSV aeainst a first toxic 
agent, where.n said first gene also can be counter-selected against, and where.n the 
recombination cassette, the RecA-like protein gene, and the first gene are linked together on 
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5 the TSSV suc h that when the nucleic acid in,^ , - 

6 the R ecA -i ikeprolein m and t , ; mtegrateS ,nt ° the ""circle seau cnce , 

7 ~ ndthefirStgene remain '^ed together, but neither the 
^ ~ ^ «"< •« — to -ntegrated nude, ac.d 

1 ^- The method orciaim P fnrfh^ 

fra**., 3* erHwmH .. ' 5 nUde ' C aC,d ' a " d P"*"* . «»nd g e„ omic 

« ^zzrr c panicu,ar ™ cieo " de «~ - — *, r , m 
™ M or t ,;;;~ , r ,on of ,hc mr - <■ - » 



1 14. The method of Claim 13 wherein tu 

' p~ m . buiIdm „ veclor n ; e ,hs r mms ° f ,he recomb '— * 

3 the TSSV. -co mb ,„a„o„ ca SKne ,s su bseqi „ nt , y , o 



1 15. 

7 



15 - The method of Claim 13 wherein the BBPAr ™ . 

---«.o, h e h „ s , cenasarastaseco „;rrz:: nsasec<>nd8enetha,bestows 

' .f -He «« tM ,o lt„. a „rr"" S 7 hOS " Ce " S " 8 Pem,,SS ' V ' ' emP ™ re *« 
'0 .he par«to lar nucleot , de ,WWMn 'ecomb,„ at ,„ n cassetle and 

( "' mcubil,in S transformed host cells a, „ „„„ 

'4 ce„s c„„,a,„, ns . Tssv ^ *" ^ ^ ^ ** ^ ~* h„ SI 
■5 rcp„c at e ; graK ^ SeltC,ed f ° r ' - TSSV c ann01 
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(iv) incubating the host cell* ™„t • 

21 cassette and the partis. , ^ recornb '™'°n 

^nu cr.e particular nucleotide seauenr^ th- M - 

K par,, ul „ „ ucleoIlde '""^ fl ""™ »««™"-» *« nuddc acd ,„ t0 the 

^ sequence and forming a resoIveH rod \r ■ i_ * 
^ 0 resoued BBPAC m the host cell- and 

^ TV method of a,™ ,5 wherein ,he ^ temperature , s 

2 permissive temperature is 43'c. n0 "' 

* I7 - The method of Claim 1 s fur+u* 

2 resolved BBPAP k C ° mPnSmg 3 H ° St CC " ^"ing a 

1 '8. The method of Claim 15 wherein »h„ n . 
' -e,nthe C o«tm ga ^ 

1 19- The method of C^m , 5 .herein the RecA-, lke pro te,n ,s recA. 

1 20: The method of Claim 19 wherein the TSSV is pSVl R Pr 4 h ■ u 

2 97968. P*r I- RecA having the ATCC no. 



21- The method of Claim S wherein the RecA lit,. , ■ 
^ P^oter; and wherein the rrans.ent * " « 

3 - S1 ent .nduct.on of the .nd.c.b.e promoter m °^J^ * ~ * *e 

1 22. The method of Chum 21 where.n the IOBCV IS a BBPAC and the k 

2 deficient host cell is a bacterium. recombmation 



1 
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1 24. The 

~ ^.iaim wherein th» d=„ a , , 

' protein is recA. 

I 25 



meiho -ofClai m 22 wherein the RccA-likc, 



against. 



" C0 " d ' ,,0nai rCP,Xa,i °" ^ — — a RecA-like p rott ,„. 

* L. vi:;;;:;;:' rep " canon shut,,e — ° r a - * -« - <■ — ^ 

• ^ xssv ore,,,. 26 , hich coma ,„ s . scne iha( cm ^ 

' 2S - Th5TSSV0fcla ' m2 ---"-~ stma , vd , neres , stance 

TCMm 29 ,ha, i.^,.^ havi „ 8 , he ATCC ^ 9?96s 

3 R=CA- „„ s , CCI , 1? ' 8 COnd, "°"" rePli — *«>. v«.or ,„ a 

4 pro*,, ' '"'^'^""""'"'"—^^vee.o.en^^a^,,,^ 



ve 



protein is recA 

1 3 <>- TheTSSV 
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FIG. 1 

Construction of the recA(+) and temperature sensitive shuttle vector 

1) Clone two small genomic fragments (> 500 bp each) into the building vector (pBV1) 
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FIG. 2A 

Map of the RU49 Genomic Locus within BAC169 
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FIG. 2B 

Map of the Modifified BAC 169 with an IRES-LacZ-PolyA Insertion 
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Hypothetical map of a gene of interest^ averted BAC (>150kb insert size) 
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FIG. 7B 

First targeted modification to introduce the positive selection marker gene 
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Second Modification to delete the promoter of the gene and to generate the short arm 
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FIG. 10 
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